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Part 1. Linnik equidistribution theorems 

1. Introduction 

Let d > 1 be an integer which, for simpUcity of exposition, we assume 
to be squarefree, and let J^d be the set of integer points on a 2- dimensional 
sphere of radius d^/^: 

^ := {x = (x, y, z) G Z^, x"^ + + = d}. 

The study of is a classical question of number theory. Surprisingly, 
there are interesting results about that have been proved in the last two 
decades, and simply stated problems that remain unresolved. In increasing 
order of fineness, one may ask: 

(1) When is J^d nonempty? 

(2) If nonempty, how large is J^d, and how can we generate points in 

(3) If J^d get large, how is it distributed on the sphere of radius d^/^? 
The first question was studied by Legendre and the answer is: 

Jifd is nonempty if, and only if, d is not of the form 4'^{8b — 1) (a, b GN), 

(or put in other terms, the quadratic equation + y'^ + = d satisfies 
the Hasse principle.) Legendre's proof from 1798 however was incomplet^ 
and the first complete proof was given by Gauss three years later in his 



Disquisitiones arithmeticae GauOl 



The second question is somewhat subtler and its resolution is the con- 
sequence of the work of several people. The fundamental insight however 
come from Gauss work who showed that Jifd is closely connected with the 
set of classes of binary quadratic forms of discriminant —d. 

This relation amounts, in more modern terms, to the existence of a natural 
action of the ideal class group of the quadratic ring Z[\/— d] on the quotient 
S03(Z)\J^. This action is, in fact, transitive (at least if d is squarefree, 
which we assume here) and is faithful if and only if d = 3 modulo 8. An 
exposition of these facts is given in ^ In particular, whereas S03(Z)\J^ 
does itself not have a natural group structure (what would the identity be?) 
the notion of "arithmetic progression" makes sense on S03(Z)\J^. 

A first consequence of the existence of this action is an exact formula 
relating \ J^d\ to the class number of Q{\/—d). From there, Dirichlet's class 
number formula expresses \J^d\ in terms of the residue at 1 of the Dedekind C- 
function of that field, and Siegel's theorem controls this value quite precisely. 



^Legendre assumed the existence of primes in arithmetic progressions, which was proven 
about 40 years later by Dirichlet. 
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yielding 

(1.1) l^rfl =dl/2+o(l)^ 

The third question is the main focus of this paper; whereas it is not at first 
clear that is a worthy successor to the first two, its investigation has proved 
very rich. Progress on it has been entwined with the study of modular L- 
functions, as well as to the study of dynamics on homogeneous spaces. The 
first significant answer regarding this question are due to Y. V. Linnik who, 
in the late 50's, proved amongst other results the following. 

Theorem 1.1 (Linnik). As d ^ +oo amongst the squarefree integers sat- 
isfying d = ±1(5), the set 

becomes equidistributed on the unit sphere with respect to the Lebesgue 
probability measure. 

In explicit terms, that means that if 

redoo : ^ 1-^ 5"^ 

denotes the "scaling" map x i— t- d^^/^.x, then for any measurable subset 
C S*^ whose boundary has Lebesgue measure zero 

^^^^^ = area(17)(l + o(l)), d ^ +oo; 

(we take the normalization area(S'^) = 1). 

Linnik obtained this by an ingenious technique which he called the "er- 
godic method." This method was generalized later (notably by Linnik's 
student, Skubenko) to establish several remarkable results about the distri- 
bution of the representations of large integers by (integral) ternary quadratic 
forms [Lin68| . Until recently, Linnik's ergodic method remained surprisingly 
little-known, although simplified treatments were given by a number of au- 



thors Tet83,Mal84 ; one possible reason is that the method did not fit into 
ergodic theory as the term is now usually understood, i.e. dynamics of a 
measure-preserving transformation. 

Removing the constraint d = ±1 modulo 5 proved to be very difficult, 
and was resolved only thirty years later by W. Duke |Duk88 ; see ^ 11.1 for 



discussion. 

The aim of the present paper is to revisit and explain in a slightly different 
language Linnik's original approach and to present further refinements which 
do not seem present in Linnik's work and do not seem accessible to the other 
known methods. We have endeavoured to simplify the original proofs of 
Linnik as far as possible, and have favoured explicitness and directness over 
generality and (more regrettably) over more conceptual approaches. 




Figure 1. d-^/'^M'd C 5^ for d = 101, 8011, 104851 
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1.2. A refinement of Theorem Consider, for x € S"^ and p > 0, the 
spherical cap Q{x, p) of center x and radius /) > (i.e. the ball of center x 
and radius p with respect to the standard Riemannian metric on S''^). Define 
the deviation of O to be 

H (0\ 1 Ired^'(^)l T 
devrf Q = — — — 1. 

area(iZ) \J^d\ 

Theorem I is equivalent to the fact that for any a; G S"^, p > 0, 

devrf(il(x, p)) — as d — )■ oo. 

We shall establish the following refinement of the prior result: 

Theorem. Fix 6,r} > . For any p ^ d~^^^~^^, the measure of x £ 5^ for 
which |devrf(ri(x, p))| ^ rj, tends to 0, as d oo as above (i.e. squarefree 
and = ±1(5);. 

Note that this theorem implies Theorem |1.1[ Moreover, in view of (1.1), 
it is easy to see that the exponent 1 /4 in the above statement is optimal. It 
would, of course, be desirable to replace ^^tends to as d ^ oo" by ''^equals 
for sufficiently large d." This seems extremely difficult. On the other hand, 
by different methods, it has been shown that this holds in a restricted range 
p ^ d~'^ for K > some absolute constant (cf. ^ 11 ). 

1.3. A modular analog. It is also possible to prove versions of the above 
results where, instead of studying the position of J^f^ on the sphere, we study 
the congruence properties of points in J^^. For q an integer coprime with d, 
let J^dio) denote the "sphere modulo g" 

M'd{q) := {x= (x,y,z) e (Z/gZ)^ x"^ +y'^ + z'^ = dmodq}. 

Theorem (Linnik). Let q he a fixed integer, coprime with 30. As d ^ +oo 
amongst the squarefree integers satisfying d = ±1(5), {d, q) = 1, the multiset 

{x (modg), xG^d} C jrd(g) 

becomes equidistributed on Mdio) with respect to the uniform measure. 

In explicit terms, that means that if 

red, : Md ^ J^d{q) 

denotes the reduction modulo q map, then, for any x G J^diQ), 

|red"Vx)| 1 



We prove also the following refinement: define the deviation at x £ J^diQ) 
to be 

I red- 1 fx) I 
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Theorem 1.4. Fix z/, 5 > and suppose that q ^ d^/'^^" and (g, 30) = 1. 

The fraction ofx.£ J^dil) for which |dev^(x)| > 6 tends to zero as d —)• oo 
with d = ±1(5) . 

A qualitative consequence of this Theorem is that: 
As long as q ^ d^/^~^ , almost any solution to sP' + + = d with 
(x, y, z) € {'L/q'L)'^ , can be lifted to a solution + y'^ + z^ = d, (x, y, z) ^1? . 

Again, it is natural to surmise that this is true for all solutions; but this 
appears to be a very difficult problem. It is also interesting to consider 
numerics related to this issue. 



In this paper, we shall discuss the proof of Theorem 1.4 in some detail; 
all the other results may be obtained by modifying that proof; see ^ 10 for a 
discussion of how to carry this out. 



1.5. Plan of the paper. In ^we describe the main steps of the proof of 
Theorems |1.1| and 1.4 for simplicity we focus on the second theorem and 



briefly indicate the modifications necessary to handle the first. Each of these 
steps is then explained in detail in the remainder of the paper: 

- Part 1 concerns those parts of the proof that are most conveniently 
presented in classical language: 

- In ^ we describe the natural action of the class group on the 
quotient S03(Z)\J^ and make it rather explicit. For that pur- 
pose, it is particularly useful to express everything in terms of 
quaternions. Presentations of similar material can be found in 
[Ven22 , Ven29] and [She86j ; our approach is slightly different. 



- In ^ we establish the important "basic lemma" of Linnik. 
Part 2 - comprised of ^ ^ and ^ is concerned with transferring 
some of the key objects - J^d{Q) ~ in terms of adelic quotients. 
This will be a key tool in proofs. 

Part 3 is concerned with the part of the proof that is related to 
expanders. 

- In ^ we prove that Jifdig) ~ endowed with a graph structure 
that we shall describe - is an expander graph. 

- In ^ we recall some basic facts from the theory of random 
walks on expander graphs. In particular we give a self-contained 
proof of a large deviation estimate (for non-backtracking paths) 
on such graphs. 

Part 4 is concerned with miscellaneous extensions of the main topics. 



- In ^ 10 , we discuss various possible extensions of Linnik's ergodic 



method. 



- In Q 1 we recall the alternative approach of Duke and others 
to these equidistribution problems, which is based on harmonic 
analysis and more precisely on the theory of automorphic forms. 
We also compare the two approaches. 
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2. An OVERVIEW OF THE ERGODIC METHOD 



We now present an overview of the proof of Theorem 1.4 We will try 
to isolate the main steps of the proof, each one of which contains some key 
results of a more general mathematical interest. In the present section, we 
treat each of these results as a black box; in the latter part of the paper, we 
"open the black boxes" one by one and provide complete proofs. 

2.1. Assumptions and notation. Throughout the paper we shall make 
the following assumptions: 

(1) d will always denote a squarefree integer, not congruent to 7 modulo 
8, and congruent to ±1 modulo 5. 

(2) q will always denote an integer prime to 30. 

Remark. One could replace 5 by an arbitrary fixed prime p and the con- 
dition d = ±1 modulo 5 by the assertion that p is split in Q{\/—d). The 
assumption that q is prime to 30 is for convenience and could be removed 
entirely. 

Write K = (Q{\/—d); we denote by the ring of integers of K, and by 
Pic(^i^) the ideal class group of i^K- We also fix a square root of —d in K, 
and denote it by \/—d without further commentary. 

2.2. Some natural quotients. The problems considered in the introduc- 
tion admit "obvious" symmetries owing to the evident action of S03(Z) on 

S"^ or Jifdiq)- We denote the corresponding quotients by 

J^rf = S03(z)\jrrf, = S03(z)\^,(g), s5 = S0s{z)\s\ 

and denote by x the orbit S03(Z)x of any element in the above sets. 

For the purposes of our main theorems, there is no essential difference 
between working with J^d, S'^ or J^d{q) and working with J^d, S'^ or J^dil) 
(since S03(Z) is finite). It is often conceptually clearer to consider the 
question of how J^d becomes distributed in Mdil) or than the similar 
question for and 5^ or J^dil)] this becomes clear when considering 
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these problems for other ternary quadratic forms, especiahy indefinite forms. 
However, the latter formulation being slightly more classical, we will make 
the (slight) extra effort required to state theorems on M'^- 

Recall that if G is a group, a homogeneous space for G is simply a set X 
on which G acts transitively (i.e. for any x, x' G X, there is g € G such that 
g.x = x'). In such a case, the stabilizers of the elements of X under this 
action are all conjugate. 

Proposition 2.3 {J^d is a Pic(^i4')-homogeneous space). Let d > 3 be a 

squarefree integer not congruent to 7 mod 8. Then there exists a natural ac- 
tion of Pic{ff k) on J^d, making into a homogeneous space for Pic^^k)- 
The stabilizer of any point is trivial if d = 3 modulo 4, and the order 2 
subgroup generated by a prime above 2 if d = 1,2 modulo 4. In particular, 

\J^d\ = 24| Pic(^x)| when d = 3 (8) 

and 

\J^d\ = 12|Pic(^ii-)| whend = 1,2 (4). 



We prove a slightly more precise version of this statement in ^3.1 and 
explain its adelic manifestation in §6.1[ 

Given a C an ideal, we denote by [o].x the action of its corresponding 
ideal class on some element x G J^d- 

2.4. The Pic(^i^)-action. While it is possible to describe explicitly the 
action of Pic(^i4:) on (or at least the action of the prime ideal classes), 
for the proof of the main results we will only need the action of the primes 
above 5. Since d = ±1(5), the prime 5 splits in K, which is to say that the 
principal ideal b^x factors into a product of two prime ideals 

We shall now realize explicitly the action of [p] and the group it generates. 

Let A,B,C be, respectively, rotations by angles acos(— 4/5) around the 
x, y, z axes. They are described by the matrices: 





Proposition 2.5 (The action of a prime ideal). For x G J^d, exactly two of 
{Ax, A-'^x, Bx, B-^x, Cx, C'^x] 

belong to M'd- The classes of those two points in are [p]-x and [p'].x = 
[p]-\x. 

Proof. Since multiplication of an element of J^d by either of the matrices 
A,B,C or their inverse produce vector with rational coordinates with de- 
nominator equal to 1 or 5, the first part of the proposition can be verified by 
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direct computation on the set J^di^), which is to say, the set of solutions to 
+ + = d in (Z/5Z)'^; one just has to check that for each x in J^(5), 
there are exactly two choices of M G {5 A, 5A~^, 5B, 5B~^, 5C, 5C~^} satis- 
fying Mx = 0. The seco nd p art will be proved in §3.12 □ 



Note that Proposition |2 . 5| in fact defines a lifting of the action of [p] from 
J^d to Jifd- This lifting is, in fact, rather less canonical than the Pic{^k) 
action on J^^- 



2.6. Trajectories. We can use Proposition 2.5 to determine distinguished 
trajectories in J^. Write 

A :={A,A-\B,B-\C,C-^}. 



To start with, pick any x G J^d'-, now by Lemma [2.5[ there are precisely two 
matrices in ^5 - say wi, wo - such that wix and Wq^x both belong to J^d- 
Denote wix by xi, and Wq^x by x_i. Similarly, there is a unique choice of 
matrix W2 S A5 such that W2 7^ [wi)^^ and 102X1 belongs to J^d] we denote 



W2X1 by X2. In this way, repeated application of Lemma 2.5 gives rise to a 
sequence (xj)jg^ in J^d such that xq = x and for any i, 

Xi £ J^d, Xj+i G {^Xj, ^"^Xj, Bxi, B^^Xi, Cxi, C'^Xi] - {xj_i}. 

Alternatively, this string can be represented by the data of x = xq and 
an infinite (necessarily periodic) word Wx = {wi)i,zz, in the alphabet ^5, 
satisfying, for any i, 

(2.1) Wi+i ^ (wi)~\ Xi = WiXi-i e J^d- 

A word satisfying the condition Wi+i 7^ {wi)~^ is called reduced; it is easy to 



see that M^x is the unique reduced word satisfying (2.1), up to the "switch 
directions" transformation given by switching Wi and w^^-, or, equivalently, 
switching Xj and x_j. We refer to the sequence (xi)jgz as the trajectory of 

X. 

The equivalence of this trajectory to the one defined by the action of [p]^ 
is explained in §3.10 



2.7. An example. For the sake of concreteness, we include an explicit ex- 
ample. Take d = 101. In this case, = 168 and |Pic(i^ii-)| = 14. The 
points of \J^d\, up to the action of S03(Z), are: 

(10,1,0), ±(9, 4,2), ±(8, 6,1), ±(7,6, 4) 

The trajectory containing (10, 1,0) is: 

(10, 1, 0) 4 (-8, 1, -6) (7, 4, -6) (-2, 4, 9) 
(4, -2, 9) 4 (4, 7, -6) ^4' (1, -8, -6) ^4' (1, 10, 0) . . . 



VF(io,i,o) = • • • B*C-^B-^C-^AC-^A-^ 



with B* = wi. 
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Figure 2. The trajectories of (10,1,0) (red) and (6,1,8) 
(green) within 

We note that after seven steps (and thus, after any multiple of seven steps) 
the trajectory returns to the S03(Z)-orbit of (10, 1,0). This periodicity of 
order 7 reflects the fact that the class of a prime ideal above 5 has order 7 
in the class group of Q(-v/— 101), which is cyclic of order 14. 

2.8. The lengths of the trajectories. We should emphasize that it is 
certainly possible, and in some sense probably typical, for [p]^ to be all or 
most of Pic(i^i<:). 

What we know in this direction is rather minimal. On the one hand, it is 
easy to see that 

|[p]^|»log(d); 

while this lower bound goes to oo with d, it remains quite small compared 
with the size of |Pic(^i^)| = As far as we know, it is unknown 

whether there exist infinitely many squarefree d = 1(20) such that |[p]^| is 
greater than some positive power of d. 

These considerations are, of course, related to the question: 



for which d does the process described in Proposition 2.5 
traverse all points in ? 

This condition on d turns out to be equivalent to the condition that the prime 
ideal p above 5 and the 2-torsion ideal class above 2 generate the ideal class 
group Q{\/—d). The question of whether this happens for infinitely many d 
is an interesting and seemingly difficult one; heuristically it is reasonable to 
suppose that this happens a positive fraction of the time. It is analogous to 
Gauss's famous question: 

do there exist infinitely many d > for which Q{^/d) has class number 1 ? 
The prime above 5 plays the role in our situation that a real place does 
in Gauss's problem; the period of the process above is analogous to the 
regulator of a real quadratic field. 
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2.9. Trajectories on the graph Md{<l)- We shall now begin to exa mine 



solutions to + + = d modulo a positive integer q, as in Theorem L3 
We shall suppose, as in that theorem, that q is relatively prime to 30. 

Under this assumption, the matrices = {A^A~^,B,B~^,C^C~^} de- 
fine elements in SO^{'L/ qTj) and act on Md{q)', this endows ^dio) with a 
structure of a 6-regular (undirected) graph by joining each x G to 

Ax, A-^% Bx, B-% Cx, C^^x. 

By an abuse of notation we shall also refer to this graph as J^diQ)- More pre- 
cisely, it is a multigraph - we allow multiple edges between pairs of vertices, 
and edges joining a vertex to itself. 

Now to each point x G J^d, we may associate a well-defined marked path 
on the graph J^dig) in the following way. Given x G J^d, we constructed in 



^2.6 a sequence (xj)jgz of points in Jifd-, the trajectory of x, which is well- 
defined up to the substitution Xj (-)• x_j. We denote by 7x the reduction of 
this trajectory to J^dig)', the data of 7x is the sequence of vertices (xj)^^^ of 
J^dig) (xi = redg(xi)), together with a marked basepoint xq and a choice, 

— — (i) 
for each i, of an edge joining Xj to Xj+i. We denote by 7x the segment of 

this path of length 2i, centered at the marked vertex. In other words, 

7x = (X_£,X_^+1, . . . ,X£_1,X£). 

We may refer to it as the truncated trajectory of length 2£. 

Note that the trajectories arising as 7x are not completely arbitrary paths 
on J^dig)', the condition Wi+i ^ from (2.1) implies that 7x never tra- 



verses the same edge twice in succession. (This does not, of course, forbid 7x 
from traveling from xtoy and then back to x; it just has to use two distinct 
edges joining x and y.) A non-backtracking path in J^dio) is a (marked) path 
which never traverses the same edge twice in succession; equivalently, it can 
be defined by the data of a marked point xq, and a word VFx = {wi)i^z 
satisfying Wj+i / Wi^^] the vertices be determined inductively by the 

rule that Xj+i is the vertex arrived at by following the edge labeled Wj+i 
from Xj. 

For instance, \i d= 101, q = 7,P = {7, 4, —6), we have (see ^2.7[ ) 
7?^ [(3,1,0)] A [(-1,1,1)] [(0,4, l)r 

[(-2,4,2)] [(4,-2, 2)] ••• 

Here * denotes the marked vertex, and we have used square brackets [. . . ] 
to denote reduction modulo 7. 

Proposition 2.10 (Shadowing lemma). The sequences and Wx' coin- 
cide for —£+ 1 ^ i ^ £ if and only if = ibx' modulo 5^ . In particular, 
7^^^ = 7^^^ only i/ x = ibx' modulo qb^ . 

This proposition states roughly that if two points have their trajectory 
agree for a long time then their initial points have to be 5-adically close 
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as well as congruent modulo q. ( In particular, the trajectories 7x, 7x' are 
equal if and only if x = itx'.) 

The "if" part is easy; the "only if" is not much harder, although this is 



less apparent from our definitions. Proposition 2.10 is proved in ^8.3, using 
geometric properties of the Bruhat-Tits building of S03(Q5) ~ PGL2(Q5). 

Our aim is to show that the integral points and their associated tra- 
jectories {7x,x G J^d} are well distributed, in some sense, on the graph 
J^diQ)- (In interpreting this statement, one should imagine that q is fixed, 
or not increasing too quickly, whereas d — >■ oo.) To obtain this, we shall use 
Proposition 2.10 in conjunction with: 

Proposition 2.11 (Linnik's basic Lemma). Let e G Z such that \e\ < d. 
The number of pairs (xi,X2) G with dot product xi • X2 = e is <Ce d^ . 

This is proved in Q It corresponds to the "basic lemma" in Linnik's 
ergodic method and is in a sense, a generalization of the well known bounds 

r(d) = ^ 1 «, d^ r2(d) = Yl 1 «^ 

ab=d a?+b'^=d 

Indeed, bounding the divisor function T{d), or the number of representa- 
tions r2{d) of d as the sum of two squares, amount to bounding the number 
of representations of the rank 1 form dx^ by the binary quadratic forms 
Q{x, y) = xy and Q{x, y) = x"^ + y^ respectively. By comparison. Propo- 
sition |2.11 concerns the number of ways to represent the rank two form 
dx? -|- exy -|- dy^ by the rank three form x^ ^y^ ^ . 

Proposition 2.12. Let Tj{d,i,q) be the number of pairs (xi,X2) G with 
7x ^ = 7^^^ • Then 

i, q) I JTrfl +d'(^l + 
for any e > 0. 

Proof. Indeed, 7x^^ = 7^^^ implies that x it x' = {q5^) for some choice of 
±; hence either + + = (g^S^^) or (x - x').(x - x') = (g^S^^) 

Since x.x = x'.x' = d, we have 

2x.x' = ±2d mod g^S^^ 

in either case. Thus the number of pairs (x, x') such that 7^^^ = 7^^"* is 
bounded by 

l^rfl + ^ ^d, x.x' = e}\ «, \J^d\ +d'(l + ^, 

\e\<d ^ ^ 

e=±d{q^5^') 

□ 
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Proposition 2.13 {^d{<l) is an expander). For any q coprime with 30, 
the graph J^dio) is connected and non-bipartite. In particular, its adjacency 
matrix, A{M'(i{q)), has a spectral gap and more precisely, if 

Ai = 6 ^ A2 ^ . . . \M-i{q)\ ^ -6 
denote the eigenvalues of A^J^d^q)) , then 

|A,| ^ 2V5, j / 1. 



This is proved in Section ^8.4 using the adelic description of Jifd{q). In- 



deed, the graphs J^dig) are closely related to the original Ramanujan graphs 



of Lubotzky-Phillips-Sarnak[LPS88 . The content of this assertion is equiva- 
lent to the (optimal) Ramanujan bound on the Fourier coefficients of weight 
2 holomorphic forms of level up to 18. g^. Thus, the existence of a spectral 
gap follows from any nontrivial bound for these Fourier coefficients - for 
instance, the works of Kloosterman or Rankin. 



The bound on the spectral gap in Proposition 2.13 says that the family 
of 6-valent graphs [J^diQ]) iq^—i form a family of expander (in fact 



Ramanujan) graphs as g — )• 00. We refer to Lub94 for an extensive and 
motivated discussion of expander graphs, their properties, applications, and 
their construction via automorphic forms. 

We shall use the fact that Md{q) is an expander through: 

Proposition 2.14 (Large deviation estimates). Fix r],e > 0. For any subset 
B C Md{q) with \B\ ^ rjMdiq), fraction of non-backtracking paths 7 of 
length 2i satisfying: 

\jnB\ \B\ 



2e+l \J^d{q)\' 
is bounded by ciexp(— C2^), where ci,C2 depend only on e,r]. 

By I7 n B\ we mean the number of i G [— ^, ■■■,(] such that the vertex Xj 
of 7 is contained in B; in other words, the "amount of time" 7 spends in B 
if one imagines moving along the trajectory at a constant speed. It is then 
natural to compare the portion of time spent by a path in B (i.e. the ratio 
\-^r^B\/{2^ + l)) with the probability of being in i3 (i.e. \B\/\M'd{q)\)- Large 
deviation estimates show that with high probability, long paths spend the 
right amount of time in any large enough subset \B\. 

Such estimates, first proved by Chernoff for complete graphs, are a well- 
known and useful tool in different contexts; for instance it has been fruitfully 
applied in computer science (see [IILW06| ) . We were unable to find a ref- 
erence in the existing literature for the particular version we need here (a 
large deviation estimate for non-backtracking walks), and so give a proof 



from first principles in g9.2| (Proposition 9.4) 



2.15. Conclusion of the proof. We conclude this section by explaining 
how Propositions 12 . 101 12. 1 11 and 12.141 together imply Theorem 11.4 
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(2.2 



Let Bs be the set of x G J^dig) such that 

|red-i(x)| 



devrf(x) 



1 > 5. 



\^d\/mq)\ 

Suppose that I^B^I ^ r]\JifdiQ)\- We will derive a contradiction for fixed 5,1] 
and large enough d. A similar bound applies to the set of x for which 



devrf(x) < —6; taken together these yield Theorem 1.4 

If ;S is a subset oiMd{q), and 7^^^ is the trajectory of an x chosen randomly 

- with respect to counting measure - from J^, the expected size of 7x R ;S 
is just the sum over i G [—(. . . .£] of the probability that Xj lies in B; this 
probability is |red~^(0)|/| independently of i. In other words, 



(2.3) 



\l^^r^B\ 



E 



|red-i(^)l 



We shall take B = Bs and choose i so that 



(2.4) 



1 



d\- 



Note that we've used here the hypothesis that q ^ d^^'^ 



From (2.2) and (2.3), the average value of 



exceeds 



2£ + l 



as X ranges over 



r(l + '5) ^ 



6rj. 



Since ^"''^^^^^^ ^ 1 for every x, the number of x E for which 



(2.5) 
is at least 



2^ + 1 



> 



1^^(9)1 



+ <5r//2 



5rj 



1/2-e 



for any e > (the last bound following from (1.1)). 



Let M be the number of non-backtracking marked paths on J^d{<l) sat- 



isfying (2.5). Given that i is chosen so that (2.4) is valid, we see from 



Proposition 2.12 that the number of pairs (x, x') yielding the same trajec- 
tory 7x^^ on M'dil) is not much larger than the number of diagonal pairs 
\M'd\: i.e. is bounded by <Ce d^/'^+^. It follows that 

(2.6) M ys>e,5,v 

for any e > 0. 

On the other hand, we also have an upper bound for M. The total number 
of non-backtracking marked paths of length 2i on Mdil) is on order of 
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5^^ ""^1=^(9)1 d}/"^^^ . Proposition 2.14 says that, of these, the proportion 



which satisfy (2.5) is at most d for some r 



t{5, -q) > 0; so 



(2.7) 



which yields a contradiction for d sufficiently large. 

2.16. Hecke trees on the sphere: an idea of the proof of Theorems 
1.1 and 1.2, We briefly sketch the proof of Theorems |1.1| and 1.2 The 
archimedean analogue of Proposition |2 . 13 is the fact that the Hecke operator 
on L^{S^) 

n.f: Yl ^(^•^) 

has a spectral gap: the eigenvalue 6 has multiplicity one and any other 
eigenvalue A satisfies 

|A| ^ 2^/5; 

this again is a consequence of Deligne's bounds for Fourier coefficients of 
holomorphic modular forms. 

Let T5 be the set of reduced words in the alphabet A5, which has the 
structure of a rooted 6-valent tree. To any x in S*^, one associates the 
infinite graph 75.x C 5^, which is called the Hecke tree. The most direct 
consequence of the spectral gap property is that 75.x is dense in S'^ and even 
equidistributed: more precisely, uniformly on x S S^, the set {W^.x} where 
We ranges over the reduced words in T5 of length i becomes equidistributed 
as ^ — )• +00 LPS86| . A more refined consequence is a large deviation esti- 
mate for non-backtracking marked paths of length 2i -uniform in the origin 



of the path-which is proven along the same lines as Proposition 2.14 



To prove Theorem 1.2 one proceeds from here as in the proof of Theo- 
rem 1.4, replacing conditions like "two trajectories stay equal mod q for a 



long time" with "two trajectories stay near each other on the sphere for a 
long time." 



Part 2. Classical theory 



3. The action of the class group on ^ 



We present, in Proposition 3.5, a precise v ersio n of the homogeneous 
space structure on discussed in Proposition 



2.3 



As we have remarked, 



the basic ideas here go back to Venkov Ven22 Ven29 and in some sense to 

Gauss. 

It will be convenient to modify, slightly, the definition of in the case 
when d = 3 modulo 4. Let S03(Z)~*' be the index-2 subgroup of S03(Z) con- 
sisting of matrices which act on the coordinate lines via even permutations. 
Set 

~, ^ ('S03(Z)+\^d , if d= 1,2 mod 4 
I S03(Z)\J^ ,if(i = 3mod4 
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Figure 3. The first three layers of the Hecke tree at (10, 1, 0) 

Thus, and are equal when d = 3 modulo 4; otherwise, the former is 

a double cover of the latter. 

We shall show that in fact has the natural structure of torsor for 
Pic(^i^). Recall that a space X homogeneous for the action of some group 
G and for which the stabilizer of some (hence any) point is trivial is called 
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Figure 4. The trajectory of (10, 1,0) within its Hecke tree 

a principal homogeneous space or torsor for G. In that case, for any x € X, 
the map 

g ^ G ^ g.x ^ X 

for any given x ^ X provides an identification of G with X as G-spaces. 
There is a priori no canonical way of choosing x; thus, one may think of a 
torsor as a set endowed with many different identifications with G but, in 
general, with no canonical one. For instance, the set of nth roots of 2 is a 
torsor for the group /in of nth roots of unity. 



3.1. Quaternions. We first recall the following classical facts. 

Let B be the Q-algebra of Hamilton quaternions. For x = u + a.i + b.j + 
b.k G B, the canonical involution is noted x = u — a.i — b.j — b.k, the reduced 
trace tr(x) = x+x = 2u and the reduced norm Nr(x) = x.x = u^+a^+6^+c^. 
Let B(°) denote the space of trace-free quaternions (the kernel of tr) also 
called the pure quaternions. The space B^*^) endowed with the reduced norm 
is a quadratic space and the map 

(3.1) {a,b,c) ^ a.i + b.j + b.k 

is an isometry between the quadratic space (Q^, + 6^ + c^) and (B(''),Nr). 
In the sequel, we will freely identify Q'^ with B^'^-', and, in particular, consider 
the elements of as trace-free quaternions. 

We denote by B^, B^ and PB^ = B^/Z(B^) respectively , the group of 
units of B, the subgroup of units of reduced norm one, and the projective 
group of units; these define Q-algebraic groups, and the action of B^ on 
B(0) by conjugation induces a covering and an isomorphism of Q- algebraic 
groups 



Vig80 Th. 3.3]: 



(3.2) Z(B^) -^B^ ^PB^ ^80(0^ + 62 + c2). 
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3.2. Integral structures. Let B(Z) denote the ring of Hurwitz quater- 
nions, 

B(Z) = Z[i,j,k, ]; 

It is well known that the ring of Hurwitz quaternions, endowed with the 
reduced norm Nr, is eudidean: for any y,q G B(Z) — {0}, there is x, r € B(Z) 
such that Nr(r) < Nr((7) and y = qx + r ( [SamTO §5.7]). This implies that 



any left (or right) B(Z)-ideal is a principal ideal: any finitely generated left 
(resp. right) B(Z)-module / C B(Q) is of the form 'B{Z)q (resp. gB(Z)) for 
some q S B(Q); moreover any subring of B(Q) which is finitely generated 
(as a Z-module) is conjugate to a subring of B(Z); in particular B(Z) is a 
maximal order of B(Q) and any maximal order in B(Q) is B(Q)^ -conjugate 
to it. 

Finally, under (3.1 ), the lattice Z'^ C Q'^ becomes identified with the trace 



free integral quaternions, 

B(0)(Z) = BW(Q)nB(Z). 

In particular, we obtain a map B(Z)^ — J- S03(Z) whose image is the index 
two subgroup S03(Z)+ and, similarly, (B(Z) (gj^Zp)^ — )• S03(Zp) (which is 
surjective unless p = 2). 

While our primary interest will be in the Hamilton quaternions and sums 
of three squares, the reader will observe that much of what we state here and 
below is valid for more general quaternion algebras endowed with a maximal 
order. As this may be useful for readers interested in representations by 
other ternary quadratic forms, we will write several of the intermediate 
steps in more generality. 

3.3. Construction of representations using ideal classes. As a first 
example of the usefulness of the quaternion, let us show how to deduce the 
Gauss-Legendre theorem from the Hasse-Minkowski local-global principle. If 
(i > is not of the form 4"(86 — 1), then, for any prime p, d is representable 
as a sum of three squares in Z^ and since d is positive, d is also representable 
over M. By the Hasse-Minkowski theorem (cf. jSer73| [Thm. 8, p. 41]), there 
exists x = (a, 6, c) G Q'^ such that + 6^ + = d. Let x = ai + bj + ck; 



then x'^ = —d so the ring Z[x] = Z + Zx is finitely generated. By ^3.2 there 
is g G B(Q) such that qZ[x]q-^ G B(Z) so that y := qxq'^ G BW(Z) is 
integral and satisfies y"^ = —d. 

More generally, the above scheme together with the group of ^i^-ideal 
classes makes it possible to generate plenty of new integral representations 
from a given one. 

Any element x G yields an embedding of ^{^/—d) into B(Q): indeed 
x^ = —d and thus y/^^ i— )• x defines an embedding 

i^:K^ Q[x] C B( 



^Indeed, given R such a subring, i?B(Z) is a right B(Z) ideal, so of the form _RB(Z) = 
gB(Z) and q-^ Rq C g-'7?i?B(Z) = g-ii?B(Z) = B(Z). 
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This embedding is integral in the following sense: let 

then i~^{ffx) = is the ring of integer^ of K. 

Now, given such a x and given an ^/^-ideal, /, we can also construct a 
new integral representation y E from x and I: the finitely generated 
Z- module B(Z)/,^(/) is a left B(Z)-ideal, so of the form B(Z)g and then 

y = qxq-^ C B{Z)LM)xq'^ = B{Z)L^{xI)q-^ C B{Z)L^{I)q-^ = B(Z); 

moreover if / is replaced by X.I = I.X A G , q may be replaced by 
q' = qi-x{X) and q'xq'~^ = q' ix{X)xLx{\^^)q' ^ = y- Notice that q is defined 
only up to multiplication on the left by an element of B ^ (Z) this implies y is 
well defined up to B^ (Z)-conjugacy: in view of the isomorphism B^(Z)/ it 
1 ~ S03(Z)"'", we obtain for x G fixed, a well defined map 

[/] G Pic(^i^) ^ 

Let us see that this map is a Pic(^i^)-torsor structure on 

3.4. For each pair of elements x, y G we define the abelian group 

kx-,y = {A G B(Z) : x\ = Ay}. 

Then A^..^^ has the structure of a module under 0'x (via /U G : A i— )• ;uA) 
and also under &y (via v ^ ffy : \ ^ Both structures are torsion-free, 
and thus locally free. 

In fact, in view of the isomorphism S03(Q) = PB^(Q), there exists a 
q G B^(Q) such that q~^xq = y. (By Witt's theorem, any two elements of 
of the same length can be rotated into one another.) One can then write 
Ax^y as Q[x]qn B(Z). It follows that 

Ax^y Q = Q[x]q, 

and thus that both module structures on A^-^y are, indeed, locally free of 
rank 1. 

We note that the map M — )■ M (8)^^ Ax^y gives a map from ^^^-modules 
to ^y-modules; this induces an isomorphism Pic(^a;) — )• Pic(^y), which is 
in fact the one induced by the isomorphism 0'x = ^y sending x to y. We 
denote by [Aj,_>j/] the class of Ax^y in Fic{ffx) = Pic(^i4'). 

Proposition 3.5 (Torsor structure on The set .3^^ has the structure 

of a torsor for P[c{^k) in which the unique element of Ficl^x) mapping x 
to y is given by [Ax^y] ■ 

This action o/Pic(^/^) descends to Md (obvious if d = 3(mod4)J, and 
for (i = 1, 2(mod4) the stabilizer of any point of Md is the order 2 subgroup 
generated by a prime above 2. 

More precisely, for any x,y,z G J^d, 

■^It is an order containing Z[-\/—d], integrally closed at 2, because the local order B(Z)(8)z 
Z2 contains all elements of B ®q Q2 with norm in Z2. 
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A. A^_5>y ^y^z is isomorphic to Ax^z os an {^x, ^z)-bimodule; 

B. The class of Ax^y is trivial if and only if x and y are identified in 

C. For every x and every g £ Fic{i?K ), there exists a y such [Ax^y] = g. 

D. If d = 1,2 modulo 4, and x ^ y € project to the same element 
of Jifd, then [Ax^y] is the ideal class of a prime above 2. 

Note that parts A-D imply the first statements. 



In order to prove Proposition 3.5 , we shall need a study of the correspond- 



ing local problem, which we undertake in §3.6[ 

3.6. Some local analysis. Let i? be a quaternion algebra over Qp, let ^ be 
a maximal order of B and ff^^^ C B^^^ be the lattice of trace zero elements 
in ff. As above, conjugation by element of B^ induces a surjective map 

5^ ^ SO(s(°),Nr) 

under which ff^ maps to SO(^(°),Nr). Let ff^^^ C ff^ be the group of 
norm-1 units. Let d G Zp be squarefree (that is, ordp(d) ^ 1) and let 
0'{o,d) (2 ^(0) be the set of elements of with trace and norm d; 1^(0'°') is 
obviously stable under the action of i?^ by conjugation. 

We fix an element x of ff^^''^^ and, as above, write Ax^y (or, when no 
confusion is likely, just A) for the set of A E such that 

(3.3) xA = Ay. 



The solutions to (3.3) in B form a vector space of dimension 2, so A is a 



free Zp-module of rank 2. 

Proposition 3.7. The action of ff^^^ and on ^C'*^) can he described as 
follows. 

- Suppose B is a division algebra. Then there are two orbits of ff^^^ on 
ff{Q,d) ^ yjfi^Qli interchanged by conjugation by any element of ff^ 
whose norm is not in Nr(Qp[a;]). In particular, the special orthogonal 
group of 0'^^'> acts transitively on . 

- Suppose B = M2{Qp). Then: 

— If p ^ 2 and p does not divide d, the action of G^"^^ on ^C-'^) is 
transitive. 

— Otherwise, there are two orbits of G^^^ on ^C^''^); they are in- 
terchanged by , unless p = 2 and d = 3 mod 4. 

Proof. Suppose is a division algebra. Then G is the unique maximal 
order, and consists of all elements whose norm lies in Zp. Let A be a nonzero 
element of A; then y = A~^xA. Note that conjugation by A is the desired 
isometry of G^^'^^ relating x to y. 

If there is an element a E Qp[x] with Nr(a) = Nr(A), then q^-'^A is an 
element of A of norm 1; conversely, any element of A of norm 1 is a^^A for 
some a E Qp[x] whose norm agrees with that of A. This shows that the 



LINNIK'S ERGODIC METHOD AND POINTS ON SPHERES 



21 



orbits of on ^(Q''^) are naturally identified with Q^/Nr(Qp[x]''). This 
quotient is a group of order 2. 

Now suppose that B = M2{Qp), so that we can take = GL2(Zp) and 
SL2(Zp). 

Let X = ^ be an element of (so that 6^ + ac = —d) and 



1 

-d 



c -b 
then 



A..,:- 



' b 1' 




"a " 


c 




-b 1 



and the elements of Nr(A2;_!>y) are those elements of Zp represented by the 
quadratic form Q = aX'^ -\-2bXY — cY'^ , which has discriminant —4d. Thus, 
X and y are in the same orbit of i^^^^ if and only if Q represents 1 over Zp. 

For all facts used below about isomorphism classes of binary quadratic 
forms over Zp, see JonSOl §31]. 

First, suppose p is odd. If p does not divide d, then Q is equivalent 
to + dV'^, and in particular represents 1. So in this case, ff^^^ acts 
transitively on If p divides d, then Q is equivalent to either X'^ + dY'^ 

or eX"^ + e'^dV"^, where e S Zp is a nonsquare. In the former case, y is in 
the orbit of x; in the latter case, 

e 

-e-^d 

is in the orbit of x. So there are two orbits, as claimed. In both cases, Q 



represents an element of Zp , 



so ff^ acts transitively on 



Now take p = 2. In this case, there are always exactly two equivalence 
classes of binary forms of discriminant 4(i, one of which represents 1 over Z2 
and the other of which does not. The non-representing forms are: 

• 2X2 + 2Xy + (l/2)(d+l)y2 ((i = 3mod4) 

• eX2 + e-idy2 ((i= 1,2 mod 4) 

where, in the latter case, e G Z2 is an element which is not a norm from 
Q2(\/— (i)^. In case d = 1 or 2 mod 4, we again see that either y or y' = 
e 
e-^d 

And again Q represents an element of Zg , so some element of 
to y. 

In case d = 3 mod 4, take 

1 -2 
(l/2)(d+l) -1 



lies in the orbit of x, so there are two orbits of on 

sends x 



y 



A direct computation shows that the orbit of x under i^^^-* contains either 
y or y', so again there are two orbits. In this case, the two orbits are not 
interchanged by . □ 
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3.8. Proof of Proposition |3.5|, A. We define a map 



^ '■ ^x^y ^y^z ~^ ^x^z 

by sending X ^ to Xfi. Evidently it preserves tlie natural structures of 
{^x, ^2)-bimodule on the two sides. We will show that the image of m 
- call it A' - coincides with ijAx^z, for some r] G ff^- This will imply 
our conclusion, because "multiplication by r/~^" is also an isomorphism of 
i^x, ^z)-bimodules. 

Clearly A' C A^..^.^ . The Zp-module Ax^y ®z Zp is explicitly described by 



Proposition 3.7 If p is odd, it follows from from the second case of Proposi- 
tion [3?7 that Ax^y, Ax^zi Ay^z all include elements of B(Zp) = B(Z) (^z "Lp 



whose norm lies in . It follows that A' also contains such an element. If 
p = 2, the same conclusion holds unless d = 2> mod 4; if d = 3 mod 4, we 
may conclude only that Ax^y contains an element whose norm has valuation 
^ 1. 

Both A' and Ax^z are free rank 1 modules under i?x\ thus, for every 
there exists rjp £ i^x Zp such that A' (g) Zp = r]p(Ax^z "SD Zp). In order 
that both localizations contain an element whose norm belongs to Z^ , it is 
necessary that rjp belong to (^2;(8)Zp)^ ; this implies that A'(8)Zp = Ax-^z'^'^p- 

Thus, unless d = 3 modulo 8, we indeed have an equality A' = Ax^z- 
In the case where d = 3 modulo 8, we may still draw the conclusion that 
there exists an ideal / in ^j^, divisible only by the prime above 2, so that 
A' = I.Ax^z- This implies the stated conclusion, since the prime of Q(\/— d) 
above 2 is principal when d = 3 modulo 8. 



3.9. Proof of Proposition 3.5 , B. Let us recall first of all the following: 
for x,y G J^d, 

X G S03(Z)^y <;=^> uxu^^ = y, some u G B(Z),Nr('u) = 1. 
X G S03(Z)y uxu~^ = y, some u G B(Z), Nr(M) G {1, 2}. 

These facts may be verified by direct computation. 

Notice that if Ax^y is trivial there exists a global generator A for Ax^y 
as an ^^^-module. By the conclusions of the prior paragraph |3.8[ the norm 
of A is a p-adic unit for all p > 2. 

We separate two cases, as in the prior proof. 

Case 1: d = 1, 2(4). Suppose that Ax^y is trivial. In this case, the norm 
of A is also a unit aX p = 2. Thus, there exists a unit u G B(Z)^ such that 
uxu~^ = y. 

Conversely, if there exists a unit u with uxu^^ = y, then u lies in Ax^y 
The span u.i?x '^s a ^a;-submodule of Ax^y, and coincides with it by the same 
reasoning as used in ^3.8[ Therefore, Ax-^y is principal as an ^^^-module. 

Case 2: d = 3(8). if Ax^y is trivial. In this case, the 2- valuation of the 
norm of A is at most 1. Therefore, y = AxA~^ with A an element of B(Z) of 
norm 1 or 2. 
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Conversely, suppose that x,y G J^d are related by an element of S03(Z). 
Equivalently, there exists A G A^-^y with Nr(A) G {1,2}. If Nr(A) = 1, 
then [A^j^j^] is principal, exactly as above. Suppose Nr(A) = 2. Reasoning 
as before, X.^x is an ^^^-submodule of A^^y of index ^ 2; since the ideal 
above 2 in is principal, we deduce again that [A^j-^y] is trivial. 



3.10. Proof of Proposition [3^ C. Write for ^k^^z^- Given a E i^k 
everywhere locally invertible, one defines the ^ii--ideal (a) = ai?K H ^k- 
Choose a such that (a) is in the class oi g £ Pic(^x)- 

Choosing an x G J^d, and thus an embedding t^; of ~ Q[x] in B; using 
Lx to identify a with an element of Q[x] ®%7L we consider the left B(Z)-ideal 



B(Z) (a) and argue as in ^3.3; this ideal is principal (as B(Z) is euclidean, 
^3^ so there exists q e B^(Q) so that B(Z)g"i = B(Z)(a). In particular, 

we have q = oT^u with u G B(Z) . 

Now let y = q~^xq = u~^axa~^u = u~^xu; then y lies in B(Z) and in 
B(Q), so y G JTrf. 
Now 

(3.4) Ax^y = Q[x]q n B(Z) ^ Q[x] D B{Z)q-^ = Q[x] D B(Z)(a) = (a). 
So [A^.^^^] = g, which proves assertion (3). 



3.11. Proof of Proposition 3.5, D. Notation as in the statement of 
Proposition |3.5p . The proof of 3.5 B shows that [A^^-j^j,] is a prime ideal 
above 2. It is not principal since x ^ y. Since d = 1, 2 modulo 4, the prime 
2 is ramified in Q{\/—d), and therefore this uniquely specifies [Aa,^^]. 



3.12. The Pic(^x)-action explicated. The arguments of ^3.10 enables us 



to describe the action of Pic(^ii:) on Jif£ or J^d quite explicitly. Let p be 
an id eal of norm 5 in we now verify that the algorithm prescribed in 
is indeed a lifting of the action of [p] from J^f£ to J^d- 



^2.4 



Let x G Jifdj thus giving an embedding of Lx of K into B. Let a be an 
element of which projects to a unit in when v is prime to p, and 
to a uniformizer in ^Xp- Choose q G B^(Q) such that B(Z)g-^ = B(Z)(a); 
then g has norm 5^^. 

Se1rl£2 = q{l + i)^^, which has norm 1/10. Taking y = q2^xq2, we have, 
as in ( |3.4[ ) , 

Ax^y^Q[x]nB{Z)il + i){a) 

The right-hand side is an ^^^-submodule of p of index 2. If d is 1 or 2 mod 
4, this means that Ax^y is in the class of qp, where q is the ideal of i?x lying 
over 2. If d is 3 mod 4, we have 

Q[x] n B(Z)(1 + i){a) = Q[x] n B(Z)(2a) ^ Q[x] n B(Z)(a) 



^The motivation for introducing this may become clearer later; the lifting of the action 
from JifX to Jifd involves imposing additional congruences modulo 3, and q2 will have 
better mod 3 properties. 
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SO the class of Ax-^y is just that of p. We note, however, that the action of 
p and pq on is the same, since the action of [q] permutes the fibers of 

The integral quaternions of norm 10 can all be expressed in the form ru, 
where u £ B(Z)^ and r is an element of 

(3.5) A5 = {l±3i,l±3j,l±3k}. 

It follows that, for each x in J^d, the class [p]x in must be represented 
by r~^xr for some r G ^5. Now a direct computation shows that the action 
of conjugation by the six elements of ^5 yields precisely the action of the 



six matrices appearing in { 2.6 For example, conjugation by 1 — 3i acts on 
B^*^) via the rule 

i I— )• 

J ^ + 3i)i(l - 3i) = -\j + \k, 
ID 5 5 

^{l + 3i)k{l-3i) = -\j-\k 
10 5 5 



which corresponds to the matrix A in { 2.4 



Representations of binary quadratic forms by + + 



We now discuss the proof of Proposition 2.11[ "Linnik's basic lemma." 



We shall, in fact, discuss two proofs; the first ( ^4.1[ ) is simply quoting a 
result of G. Pall, which in turn rests on Siegel's mass formula; the second, 
presented in §4.3| after a preliminary discussion, is based on ideas related to 

m 

4.1. A result of Gordon Pall. Let (Q, Z™-), (i?, Z") be two non-degenerate 
integral quadratic forms with m ^ n. One says that Q represents R if 
there exists a Z- linear map t : — )• Z™" such that, for any x G Z", 
Q(t(x)) = i2(x). It follows that l is an embedding. 

In particular, given xi,X2 G J^d with X1.X2 = e, the linear map 

t : Z^ ^ Zxi + Zx2 

defines a representation of the binary quadratic form R{x, y) = dx^ + 2exy + 
dy^ by the ternary form Q{x, y, z) = + + z^. 

Therefore, counting the number of pairs (xi,X2) G Jif^ with X1.X2 = e 
is essentially equivalent (up to the action of the finite group S03(Z)) to 
counting the number of representations of R by Q. 

The precise computation of the number of embeddings, r(a, b, c) say, of a 
binary quadratic form ax^ + bxy + cy^ into the ternary quadratic + + z'^ 
in the most general case (no primitivity assumptions, etc.) was first carried 



out by Pall Pal49, Theorem 4, page 359]. His theorem gives a formula 
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(4.1) r(a, 6, c) = 24 • 2^" • J| rp{a,b,c) 

p|2(fe2_4ac) 

where it follows from Pall's result that: 

(1) u is the number of distinct odd primes dividing the discriminant 
6^ - 4ac; 

(2) rp{a, b, c) is bounded by an absolute constant unless p^|(a, b, c). 
In particular, it follows that 

(4.2) \r{a,b,c)\ <.max{a,b,cy, 
when {a,b,c) has no square factor. 



One way to obtain a quantitative result like (4.2 ) is by use of Siegel's mass 
formula, combined with a computation of local densities. We will sketch in 
the rest of this section an alternate approach. We must first revisit the 
material of ^js] and describe a "different" (although closely related, as we 
shall see) connection between J^d and the class group. 

4.2. The orthogonal complement construction. We have seen that 
J^* can be placed in bijection with the group Pic(^i^); but there is no 
natural group structure on for there is no natural choice of an iden- 
tity element of ■ In other words, the torsor structure is natural but 
admits no natural trivialization. However, the orthogonal complement con- 
struction, which we shall now explain, gives a trivialization of the "square" 

Let Q-d be the set of binary quadratic forms aX^ + bXY + cY'^ of dis- 
criminant 



6^ - 4ac = disc(^x) 



-d, d = 3 mod 4 
-M, d=l,2 mod 4 



considered up to SL2(Z)-equivalence. Then Q-d parametrizes rank-2 qua- 
dratic lattices A of discriminant disc(^/^) G {— d, — 4d}, endowed with an 
orientation A^A = Z. 

There is a natural map — Q-d sending x £ J^d to the induced 
quadratic form on x-^ (the rank 2 lattice in of vectors orthogonal to 
x); here we understand that, if d = 3 modulo 8, then we scale the resulting 
quadratic form by ^, and that a basis (A, A') of X"*- is oriented if AAA' Ax > 0. 

Composing with the classical identification of Q-d with Pic{i^K) now 
gives a map 

(4.3) Perp : ^* ^ Pic(^i^) 



^The situation is similar to one familiar from geometry: If X is a smooth cubic plane 
curve over a field k with Jacobian E, then the points X{k) form a torsor for the group 
E(k). This torsor has no canonical trivialization, but the embedding of X into the plane 
gives a canonical trivialization of the "cube" X x e X x e X . 
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which can be described concretely as Perp(x) = [A^-s^s]. Both sides admit 
an action of Pic(^x); the left-hand side by means of the torsor structure 



described in section § 3.1 and the right-hand side by multiplication. But 



Perp is not equivariant for this action; rather, it intertwines the action of 
Pic(^i^) on the left with the square of this action on the right, as we will 
now see. Suppose given x,y in Jif^ such that [A^_5>j/] = a £ Pic(^/^). One 
checks that [Aj,_j.2,'] = [A^ Thus 

Perp (7/) = [Ay^y] = [Ay^^][Ax^s][Ax^y] = a^ Perp (x). 

It follows that the image of Perp is precisely one coset of 2Pic(i^;^). 
Moreover, the cardinality of a fiber of Perp is just the order of the 2-torsion 
subgroup of Pic(i^ii-). 

What Perp supplies is not a trivialization of the torsor , but a trivial- 
ization of its square in the group of Pic((^i<-)-torsors. The square is a torsor 
T which can be described explicitly as the set of equivalence classes of pairs 
(x, y) under the relation (x, y) = {ax, a~^y) for all a G Pic(^ii-). The action 
of a G Pic(^/f) sends to {ax,y) (or to {x,ay), which is the same.) 

Then the map sending (x, y) to [Aj^_!>g] is a canonical isomorphism between 
T and Pic(^A-). 

4.3. Bounds for representations, revisited. We now sketch how the 
"orthogonal complement construction" leads us to another proof of Propo- 
sition ETB 

For simplicity, we consider only the case where 6^ — 4ac is of the form 
—4d, with d squarefree. Given an embedding 

L : {I?, ax^ + bxy + cy^) ^ (Z^ x"^ + y"^ + z^). 

consider the orthocomplement inside 1? of It is generated by a single 

vector X G unique up to sign. The quadratic lattice x"*" is plainly the 
same as its sublattice i(Z^) after tensoring with Q; since d is squarefree, the 
two lattices are in fact identical. This implies that x.x = d. 

The embedding i is determined up to at most 6 possibilities (6 being the 
maximal number of automorphisms of a positive definite form in rank two) 
by X. It suffices, therefore, to count the number of x G J^d so that the 
quadratic form induced on x"*" is isomorphic to (Z^,ax^ + hxy + cy^). By 



^4.2, this is at most 24| Pic(^ii-)[2]|, where [2] denotes 2-torsion. By genus 



theory we have | Pic(^i^)[2]| ^ d^, and the bound (4.2) follows. 

Part 3. Adelization 

In this part, we interpret, in adelic terms, the various classical arithmetic 
sets and structures discussed so far -i.e. Md, •^d-, Pic(i^K), J^diq), -^dio) 



It is irresistible to ask which coset. A quadratic form Ax^ + Bxy + Cy^ of discriminant 
d embeds, over Q, into 1? with the standard quadratic form only if {A, d/A)p = (d, d)p 
for every p dividing 2d; here (a, b)p is the Hilbert symbol. This condition in fact defines a 
coset of squares, and so describes exactly the image of Perp. 
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- and the various maps between them. This interpretation gives us an 
alternative approach to the results described in Part[2]and more importantly, 
will be key to the proof of Proposition 2.13 We refer to Kna97 for a recent 



gentle introduction to the adelic theory of algebraic groups (in relation with 



automorphic forms), and to BM66,PR94 for more extensive treatments. 
We denote by the ring of finite adeles of Q, i.e. the restricted product 



of 



cpjp prime 



with respect to the sequence of maximal compact subgroups 



(Zp)p prime, and by A = Mx the ring of adeles. We denote by Z = Yl^ Zp C 

Af the maximal compact subgroup of Aj-; alternatively Z is the closure of Z 
in Aj. Given G a Q- algebraic group with a chosen model over Z, we denote 
by G{Af), G(A), G(Z) the groups of points of G in the corresponding rings. 

We begin by summarizing the contents of this part (^- Q via a com- 
mutative diagram: 

We set (see ^5.2 below for the definition of genusso3(^^)) 



V 



{(L,x), L G genusgo3(Z'^), x € L, x.x = d}. 

{(L,x), L E genusgo3(Z'^), ^GL/qL, x.x = (i(modg)}. 



For the sequel, we need to fix base points xq G J^d and x^ G J^(g); this 
being done, we define 



(4.4) 
(4.5) 



Kf[q] 
K'fiq] 



ker(S03(Z) ^ S03(Z/gZ)) 
{g G S03(Z) : g.^g = x^} 



and denote by SOxq the stabilizer of xq in SO3. We will show in the fol- 
lowing sections how to describe J^d and J^dil) adelically, by means of a 
commutative diagram 



(4.6) 



redo 



^d{q) 



-SO3 

red, 
SO3 



SOxo(Q)\SOxo(A/)/SOxo(Z) 



S03(Q)\S03(A/)/K'[q]. 



We will also explain how to describe M'd and J^diQ) adelically, which is 
slightly more involved technically (though no different conceptually.) 
For this we define 



Q = {(L,x,6'), L G genusgogl 
(3,g) := {(-^,x,6'), L G genusgogi 



^), X G L, yL.x = d,e : L/3L ~ (Z/SZ)^}. 

9 : L/3L ~ (Z/3Z)3}, 



{{L,x,9), L G genussoo(Z^), xeL/qL, x.x = d(modg). 



and, for any integer a, 

(4.7) Kf[a,q]:=Kf[a]nK}[q]c K'fiq]. 
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Then we will construct a diagram 
(4.8) 

■^d ^ S03(Q)\Q SOxo(Q)\SOx„(A/) X S03(Z/3Z)/SOxo(Z) 



redo 



redo 



=^d(g)^S03(Q)\P(3,,) 



S03(Q)\P(,) 



S03(Q)\S03(A/)/i^/[3,(7] 



I 

S03(Q)\S03(A^)/i^U(?]; 



where the vertical arrows at the bottom are the evident surjective maps. 
The horizontal arrows of these diagrams are described in ^ and ^ while 
the vertical arrows are discussed in ^ 

4.4. Convention: orthogonal groups vs. unit group of quaternions. 



We have already noted in ^3.1 that the quadratic spaces 



+ + 



(B^'^), Nr) are isometric and that the action -by conjugation- of the units of 
Hamilton quaternions, B^, on B^*^^ induces and isomorphism of Q-algebraic 
groups 

PB^ ~ SO3 

with PB^ = Z(B^)\B^ the projective group of units. In particular the 
group of adelic points PB^(A) and S03(A) get naturally identified as are 
their various respective subgroups. In the sequel, we shall freely use this 
identification, moreover we will use the same notations for various subgroups 
Kf\q\, K'^\q\ etc. to denote either some subgroup in S03(A) or its image in 
PB^ under the above isomorphism. 

5. Adelic interpretation of J^diq)- 
In this section we identify J^dil) (as well as the sphere 5^), with an adelic 



quotient of SO3, verifying the second lines of (4.6) and (4.8). 



5.1. Adelic actions on rational lattices. The adelic group GL3(Aj) acts 
on the space of lattices in Q'^ as follows. If L C is a lattice, and g = Qp 
is an element of GL3(Aj), we define 

g.L = {a G q3 : ^^(a) G Qp.Lp for all p}. 



where Lp : 
stabilizer of Ln 



)p is the inclusion. This action is transitive, and the 
under this action is GL^{'L). 



5.2. The SOs-genus of the lattice Lq is, by definition, the orbit of Lq under 
the action of S03(Aj) C GL3(Aj): in other words, this is the set of all 
rational lattices which are everywhere locally isometric to Lq: 

genusso3(^o) = S03(A/).Lo ~ S03(A/)/S03(Z). 
The set of genus classes of Lq is the set of S03(Q)-orbits in genusgo3 (-^>o)- 
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Proposition. There is only one SOs-genus class - that is, the quadratic 
form + + has genus one. Equivalently, 

(5.1) S03(A/) = S03(Q)S03(Z). 

Proof. Let L G geims^Q^^A f).LQ, in particular the covolume of L equals 
the covolume of Lq which is one. We identify Lq with the traceless integral 
quaternions B*^*^) (Z) and SO3 with PB^ ; in these terms, we need to show that 
there is g G B^(Q) such that qLq^^ = B*^^)(Z). By definition, there is qf E 
B^(A/) such that L = qfB^^\Z)qJ^ and so ^ := Z + L = qf{Z + B^^\Z))qj'^ 
is a lattice in B(Q) containing the identity and stable by multiplication, i.e. 
an order; hence, by ^3.2| there is q £ B^(Q) such that qGq~^ C B(Z) and 
so qLq~^ C B^^\Z); since qLq-^ and B(°)(Z) have the same covolume they 
are equal. □ 
If we replace + + z'^ by a different ternary quadratic form Q, this 
will in general not be so. 

5.3. Level structure. By construction, the quadratic form + + 
takes integral values on any lattice in genusgo3(-^^o). In particular, given q ^ 
1 an integer, the quotient lattice L/qL (Z/qZ)'^ is naturally a quadratic 
space for the form x'^ + y'^ + z"^ . A q-level structure on such a lattice is an 
additional datum related to L/qL. Here, we will consider two type of level 
g-structures: 

- the principal g-structure: this is the datum of an isomorphism of 
quadratic spaces 9 : L/qL ~ (Z/gZ)^. We will use this only for 
q = 3 and mainly for cosmetic purposes. 

- a weak g-structure: this is the datum of a point x G L/qL such that 
x.x = d(modg) when {d,q) = 1. This will be the main structure 
considered in the present paper. 

Related to these level structures, are the open compact subgroups of 
S03(Z), 

Kf[q], K'fiq], Kf[3,q] 



defined by (4.4), (4.5), ( |4.7| ) relative to the choice of some base point Xg E 

5.4. M'dio) as an adelic quotient. Let us consider first the set of pairs 

^(<?) := L e genussogl^o), x G L/qL, x.x = d(modg)}. 

The group S03(Q) acts diagonally on 'P(q). From (5.1) any S03(Q)-orbit in 
^(g) contains a pair of the form (Lo,x) with x G Md{q). Moreover if two 
pairs (Lo,x) and (Lo,x') give rise to the same S03(Q)-orbit then x and x' 
differ by an element of S03(Q) n S03(Z) = S03(Z). It follows that the map 
X G J^diq) ^ {Lq,x) G 'P(g) induces a bijection: 

S03(z)\jrd(g) ^ S03(Q)\P(g). 

In the sequel, we will denote by [L,x] the S03(Q)-orbit of the pair (L,x). 
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In fact, since L/qL ~ {L ^z'^)/q{L ^z'^), the whole group SO^i^f) acts 
diagonally on 'P(g). 

Lemma. This action is transitive: fixing Xg G M'(i[q), we have 

= S03(Aj).(Lo,Xg). 

Proof. Indeed any (L, x) G 'P{q) is in the orbit of a pair of the form (Lq, x') for 
some x' G It follows from the Lemma below that S03(Z) (through its 

projection to S03(Z/gZ)) acts transitively on J^(g). Taking fcx' G S03(Z) 
such that fc^/Xg = x', we have fc_/(Lo,x') = (Lo,Xg). □ 

Lemma 5.4.1. For any prime p, the group S03(Zp) acts transitively on 
J^di'^p) (defined in the vident way). Consequently, S03(Z) acts transitively 
on ^(Z). 

Proof. This - which can be thought of as an analogue of Witt's theorem for 



rank 3 lattices over Zp - is immediate from Proposition 3.7, using the first 
case if p = 2 and the second case if p is odd. □ 
We thus have 'P(q) ~ S03{Af) / K'j[q]. From the above discussion, we de- 
duce that 

(5.2) 5^,(g) ^ S03(Q)\P(g) ^ S03(Q)\S03(A/)/ir}[(?]. 

5.4.2. Lifting to JifdiQ)- This adelic realization of J^dio) may, in fact, be 
lifted to an adelic realization of Md{q) itself, at least when q is coprime 
with 3. For this we add the additional data of the principal level 3-structure 



discussed in [ 5.3 This is a little bit artificial, relying on the special fact that 
the reduction modulo 3 maps yields an isomorphism S03(Z) ~ S03(Z/3Z). 
Consider the set of triples 

{(L,x,0), L G genusso3(-i^o), ^£L/qL, x.x = 6 : L/3L ^ {Z/3Zf}. 

As above, S03(Aj) (hence its subgroup S03(Q)) acts diagonally on "^(3,^): 
explicitly for g G S03(Aj), we have 

g.{L,5l,e) = {g.L,g.^,9 o g-^). 



We consider first the S03(Q)-orbits in 'P(3.g)- From (5.1) any S03(Q)-orbit 
contains a triple of the form {Lq,x.,6). Moreover, since reduction modulo 
3 is an isomorphism S03(Z) ~ S03(Z/3Z), the map x 1— )• (x. Id) yields a 
bijection between J^dig) and the set of S03(Z)-orbits of pairs {(x, 0), x G 
J^diq), : (Z/3Z)3 ~ (Z/3Z)3}. From this, we deduce that the map x G 
Jifdiq) ^ {Lo,x.,ld) G 'P{3,q) induces a bijection: 

=^d(g)^S03(Q)\P(3,g). 

Returning to the action of the whole group S03(Aj), we have 
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Lemma. This action is transitive: fixing Xg G M'dio), we have 

7^(3,,) = S03(A/).(Lo,Xg,Id). 

Proof. The proof is exactly as above: any triple is in the orbit of a triple of 
the form (Lo,x,6l), x G ^(9), : {Z/3Zf ~ (Z/SZ)^. This follows from 
the fact that np|gS03(Zp) x S03(Z3) acts transitively on the set of pairs 
(x, 6); recall (^2.1[) that we are assuming that q is prime to 3. □ 



We have "^(3,5) — S03{Af)/Kf[3,q]. (Recall the notation from ^5.3). 
From this and the above discussion, it follows that the map 

xGjrd(g)^(Lo,x,Id)GP(3,g) 

induces a bijective map: 

(5.3) Jifdiq) ^ S03(Q)\P(3,,) ^ S03(Q)\S03(A/)/K/[3, q]. 

5.5. The sphere as an adelic quotient. For completeness, we recall the 
interpretation of sphere 5*^ as an adelic quotient; this will not be used there 



but it is the way to proceed in order to adapt the proof of Theorems 1.3 



and 1.4 to obtain Theorems [LT] and 1.2 or to obtain hybrid equidistribution 



theorems. Let A = M x Aj denote the full ring of adeles, then S03(A) 
S03(M) X S03(Aj); hence, by ( |5.1[ ) we have the identification 



S03(Z)\S03(M) ^ S03(Q)\S03(A)/S03(Z). 

Since S03(M) acts transitively on S'^, we obtain - choosing some point Xqo £ 
5^ and letting 

i^oo = S0,^(M)~S02(M) 

- the identification 

(5.4) S03(Z)\52 ~ S03(Q)\S03(A)/i^oo.S03(Z). 

As in the previous section, we may remove the quotient by S03(Z) by adding 
the principal 3-structure and obtain 

S03(M) S03(Q)\S03(A)/K/[3]. 

(5.5) 52 ~ S03(Q)\S03(A)/i^oo.i^/[3]. 
6. Adelic interpretation of J^d- 



In this section, we describe the first line of the diagrams (4.6) and (4.8). 

The significance of this in the proof is as follows: We previously described 
an action of Pic(^x) on and also a lifting of this action - at least in 
the case of a prime ideal above 5 - to J^. After the present section, we will 
understand this action (and also the lifted action, although this is only of 
cosmetic interest) in terms of adeles. This will be a key tool in the proof of 
Proposition |2. lO] and Proposition 2.13 



The presentation of this section follows that in EV08 , but we emphasize 
that the material is in essence classical. 
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6.1. As before, let V be the set of pairs 

V = {{L,yi), x.x = d, X G L, L € genusgOg (i>o)}- 

By contrast with 7^(g) or "^(3,5), the set V carries no natural action of 
S03(Aj); on the other hand, the group S03(Q) acts on V diagonally. By 



(5.1), every S03(Q)-orbit in V contains an element of the form (Lo,x) 
(where x belongs to ^d)] two such pairs differ by the action of a unique 
element of S03(Q) n S03(Z) = S03(Z). From this is follows that the map 
X G J^d I—)- (-Loix) G V induces a bijective map 

(6.1) M'd ^ S03{Q)\V. 

We will now realize S03{Q)\V as an adelic quotient by endowing it with 
a transitive action of a subgroup of S03(Aj). 

Choose an element xq G J^d, and let SOxq be its stabilizer in SO3. Since 
the quadratic space is 3-dimensional, SOxq is the special orthogonal group 
of a quadratic plane (namely Qxq ), and so is a 1-dimensional torus. 

The group SOxq(Aj) acts (by multiplication on the first coordinate) on 
the subset of V consisting of pairs of the form (L,xo). We can extend 
this action to the whole of S03(Q)\T' as follows. By Witt's theorem, every 
S03(Q)-orbit in V is of the form [L,xo], for some L £ S03(Aj)Lo. Then or 
any t £ SOxo(Aj), we define 

t.[L,xo] := [t.L,xo]. 

This is well defined: if [i,xo] = [L',xo] then L and L' differ by an element 
r G SOa.Q(Q); because of the commutativity of SOxq, 

[frL,xo] = [TtL,xo] = [tL,xo]. 

This action is transitive: consider the orbit of (L,xo) where L = g.L p and 



5.4.1 



g G S03(Aj). Since L contains xq, Lq^Aj contains g ^'Kq; by Lemma 
there is an element k of S03(Z) which sends xq to g^^:x.Q. In particular, 
gk = t lies in SOxo(A/) and t.[Lo,xo] = [gkLo,xo] = [L,xo]. 

Finally, from the definition of the action it is easy to see that the stabilizer 
of [Lo,xo] is the subgroup SOxo(Q)SOxo(Z) where SOxo(Z) = SOxo(A/) n 



S03(Z). Hence (6.1) extends to bijections 



(6.2) S03(Z)\jrrf ^ S03(Q)\P ^ SOxo(Q)\SOxo(A;)/SOxo(Z). 

Remark. This setup is very specific to the three dimensional situation, in 
particular the fact that SO^q is commutative. If one studies the question of 
the representations of an integer d by a higher rank quadratic form Q, the 
quotient SOq(Z)\c^, when non-empty, still has a description in terms of 
double cosets of an adelic group (see [EV08j for details) but will not carry 
an action of an adelic group as it does here. 
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6.2. We shall now identify SOxo(Q)\SOxo(A/)/SOxo(Z) with a quotient of 



the commutative group Pic(^x)- Therefore, the identification (6.2) may be 
considered as giving an action of Pic{ffK) on J^^- 

For this purpose, it is again most convenient to phrase everything in 
terms of quaternions via the identifications ~ B(°)(Q) and SO3 ~ PB'^. 
In particular we view xq as a trace-free quaternion. Now the stabilizer 
SOxo — PB^Q is the multiplicative group generated by (conjugations by) 
invertible quaternions of the form a + 6x0 . In the previous section, we have 
defined a transitive action of SOxo(A/) ~ PB^q(Aj) on J^^- 

Now, the map a+b\/'^^ 1— )• a+6xo defines via conjugation an isomorphism 
of Q- algebraic groups 



(6.3) Res^/QG^/G^ ~ PB 



X . 



in particular, for any field F D Q, PB^jj(F) ~ (F (g)Q KY/F 



Thus, denoting by = {Kf®K)^ the group of finite ideles of K, (6.3) 



defines an action of A^^ on J^d- The subgroups A^ and act trivially. 



as does U C A^^, the preimage of PBx(,(Z) under (6.3). The maximal 



compact subgroup = {^k ®z ^)^ of A^^ is certainly contained in U; 
therefore, we have a well defined transitive action of 



on J^d- This action being transitive, is identified with a quotient of 
Pic(^i^), and is in fact a torsor under the abelian group K^\K^ JU . A 

careful study of the inclusion <Z U aX 2 recovers Proposition |3.5 

A priori, the action we have defined depends on xq. We verify indepen- 



dence in ^6.4 



6.3. Lifting the action to As before, by introducing an extra level 
3-structure, we may replace by its covering and obtain a lift of the 
action of SOxo(Aj) ~ PBx„(Ay) and thus of A^^ on to an action on 
J^. Notice that this latter action is, in general, not necessarily transitive; 
nor does it, necessarily, factor through the class group, but only through a 
certain ray class group. 
Consider the set of triples 

Q = {(L,x,0), L e genusso3(Fo), x G L, x.x = d, : L/3L ~ {Z/ZZf}. 

Using that the reduction mod 3 map from S03(Z) to S03(Z/3Z) is an iso- 
morphism, we may verify as above that the map 

^d^S03(Q)\Q, x^[Lo,x,Id] 

is bijective. 



34 J. S. ELLENBERG, PH. MICHEL AND A. VENKATESH 

As above, the group SOxo(A/) acts on S03(Q)\Q: by Witt's Theorem, 
every S03(Q)-orbit in Q is of the form [L, xq, 9] and we set for t G SOxo(A/), 

t.[L,xo,0] = [t.L,xo,0ot-i]; 

from the previous discussion, this action is well defined. This action, how- 
ever, is not transitive: the various SOxo(Aj)-orbits are parametrized by the 
orbits of S03(Z/3Z) under the action of SOxo(Z). In other words, we have 
a bijection 

(6.4) M'd ^ S03(Q)\Q ^ SOx„(Q)\SOx„(Aj)x S03(Z/3Z)/SOxo(Z), 

where SOxq(Z) acts diagonally on the product SOxo(A/) x S03(Z/3Z). Un- 
der this identification, the action of t G SOxo(A/) is the one induced by 

t.[t',K] = [tt',K\, {t',K) G SOxo(A/)xS03(Z/3Z). 



Thus (6.4) gives us the desired way to lift the action of f 



6.4. Independence w.r.t. xq. Let us see that the above defined actions 
of on J^d, do not depend on the choice of the base point xq. 

Let Xq G be another point. By Witt's theorem Xq = p.xg, for some 
p G PB'<(Q). Then PB^^ = pPB^^^p^^ Let u = a be an element 

of A^ (a, 6 G Aj), and let (resp. t^) denote the corresponding element in 
PB^^(A/) (resp. in PB^, (A/)). Then t'^ = pt^p'^. It wiU suffice to see that 

t„.xo = t'u.xo, or equivalently 

t„[Lo,xo,Id] = 4[Lo,xo,Id]; 
the latter expression equals 

4[^o,P"^xo,Id] = t'„[pLo,xo,p"^] = [t^pLcXo,/?"^'^^] 

= [t„Lo,xo,t~^] = tu[Lo,yLo,ld]. 

□ 



7. AdELIC INTERPRETATION OF redo : J^d 



and redoo in terms 



In this section, we interpret the reduction maps, redg 
of the adelic quotients from the previous sections. 

Recah that, in ([S^, ^?2\, (|631 ) (resp. ([5l5| ), ( [53| ), we have given 

adelic identifications of S"^, 
S\ M'M and Jfd- 



J^d{<l) and J^d, and of their finite coverings 
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7.1. First, let us recall the identifications 

^ S03(Q)\P, m{q) ^ S03(Q)\P(q) 

where 

S03(Q)\P = {[L,x],LG S03(A/).Lo, xeL, x.x = d} 
S03(Q)\P(,) ={[L,x], LG S03(A/).Lo, 5ieL/qL, x.x = %)}. 
The map redg : M'd — )■ M'dio) is induced by the natural map 

X e L i-> X G L/gL 

which we also denote redg. 

We now explain how to write redg in the adelic language. Let t be an 
element of SOxo(A/) and [t] its double coset in SOxo(Q)\SOxo(A/)/SOxo(^). 

Recall that xq and Xg were basepoints in Md and J^dil) respectively. We 
will demonstrate that the reduction map redg, thought of as a map 

red, : SOxo(Q)\SOxo(A/)/SOxo(Z) ^ S03(Q)\S03(A/)/i^/[(/] 

is the one sending [t] to [t./cx^], where k^^ G S03(Z) is a fixed element 
satisfying k^g.Xg = xo(modg). 

To see this, let t £ SOxo(Aj) be a representative for one of the double 

cosets in SOxo(Q)\SOxo(Aj)/SOxo(^)- We may choose t to have integral 
coordinates at all primes dividing q. Write (3 = jk, with 7 G S03(Q) and 



k G S03(Z). Then by the definitions of (6.2) and (5.3), one finds that the 
element of S03(Z)\J^ corresponding to t is 7~^xo, while the element of 
S03(Z)\J^((7) corresponding to tk^f^ is /cxq. But ^k fixes xq (whence also 
Xq), so the reduction of 7^"'^xo is precisely /cxq. 

Again, we can lift the situation to J^: there is a natural map 

SOxo(Q)\SOxo(A/)xSO(Z/3Z)/SOxo(Z) ^ S03(Q)\S03(A^)/K/[3, g] 

which corresponds to the reduction map -^(9)- Explicitly, choosing 

for base points the triples (Lo)Xo,Id), (Lo,Xg,Id) the map is given by 

(7.1) [t,K]^[tk^,k3] 

where k^ G S03(Z3) is a lift of k G S03(Z/3Z) and k^^ is as above but such 
that its component at the place 3 is trivial. 

Part 4. Graphs and Expanders 

8. The graph structure on J^diq) 



We shall now replace the role of the finite adeles Aj in the bijection (5.3) 
by the much smaller ring Q5. More precisely, we will show the existence of 
a bijection: 

(8.1) J^diq) ^ r(3,,)\S03(Q5)/i^5, 
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where C S03(Q5) is a suitable lattice (i.e. a discrete cofinite subgroup) 
and is the maximal compact subgroup 803(25). We have 

S03(Q5) =^PB^(Q5):^PGL2(Q5) 

(since B(Q5) ~ M2(Q5)), therefore the quotient S03(Q5)/i^5 is identified 
with 

PGL2(Q5)/PGL2(Z5) =: Ts 
which has the structure of an infinite 6-valent tree (namely, the Bruhat- Tits 
tree of PGL2(Q5), cf. [SerOSj). 

The set J^dil) has thus a structure of a finite quotient of % and we 
will see that this graph structure coincides with that described in §2.9[ In 
particular, the latter is connected. 



From this viewpoint, it will be possible to prove Proposition 2.10 and 



Proposition 2.13 (i.e. the graph Md{q) is an expander). The latter relies 
on the theory of automorphic forms, especially the Jacquet-Langlands cor- 
respondence and the work of Eichler-Shimura. 

Let us mention that a good part of this section is closely related to the 



book of Lubotzky |Lub94 , especially Chapter 6 and the Appendix, in which 
the reader will find a motivated discussion of the passage between automor- 
phic forms and expanders. 

8.1. Mdil) as a quotient of a tree. If w is an element of PB^(Q5), we 
denote by \w\^ the element of PB^(Aj) which projects to w at the 5-adic 
place, and to the identity everywhere else. When no confusion is likely (as 
in the statement of the following lemma) we will identify PB^ (Q5) with the 
subgroup [PB><(Q5)]5 of PBX(A/). 

Lemma. One has 

PB>< (Q).PB>< (Q5).i^/[3, q] = PBX (Aj). 

Consequently, the map g G PB^(Q5) — )• [g\^ G PB^(Aj) yields a bijective 
map 

(8.2) r(3,g)\PB><(Q5)/i^5 ^ PB^(Q)\PB^(Aj)/i^^[3,(?] ^ JT^iq) 
where r(3^g) is the lattice PB^ (Q) n PB^ (Q5).A'/[3, g] . 

Proof. This main ingredient of the proof is the so-called strong approxi- 
mation property (for simply connected semi-simple algebraic groups): we 
will not discuss this property in any generality and refer to [PR94 Chap. 



7] for a complete treatment. Alternatively, the reader may also refer to 



[VigSO Chap. Ill ] for a discussion of the strong approximation property 
in the context of quaternion algebras. For now, let us merely say that, if 
PB^ satisfied the strong approximation property the assertion would follow 
immediately; unfortunately, PB^ is not simply connected. 

One remedies this problem by passing from the Q-algebraic group to 



its double cover B^ (cf. (3.2)). The group B^ is simply connected and 
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semisimple. Moreover, B"'^(Q5) is noncompact. Thus, it satisfies a strong 
approximation property: for any open compact Q C B^(Aj), we have 

It follows that PB'<(Q).PB'<(Q5).ii:/[3,g] contains the image 

e = B\Af)/{±l} 
of B^(A/) in PB''(A/). It wih suffice, then, to verify that 
(8.3) PBX(Q).e.K/[3,<?] =PBX(Aj); 



this will follow from if (G n PB^ (Z)).K/[3, q] = PB^ (Z); equivalently, 
if e n PB^ (Z) acts transitively on PB^ {Z)/Kf[3, q] ~ 

In turn, it is equivalent to show that Qp D PB^(Zp) acts transitively on 
J^(Zp) for each p|3(?, where Qp is the image of B^(Qp) in PB^(Qp). 

Recall that J^dC^p) is identified with B('^''^)(Zp), the trace quaternion 
of norm d e Ijp] then Qp PI PB^(Zp) contains B^(Zp)/{ibl} acting by con- 
jugation. The transitivity of this action follows from the second part of 



Proposition 3.7, using the fact that {p,2d) = 1. □ 



8.2. The graph structure. We can describe the (Bruhat-Tits) graph struc- 
ture on PB'' (Qs) /FB"" (Z5) in two equivalent ways. We write K5 = FB"" (Z5) 



in what follows. Recall that consists of a set of six matrices (as in ^2.6) 



or six quaternions (as in ^3.12) depending on whether we are working in 
SO3 or PB^. 

(1) Fix 05 S B^(Q5) such that the 5-adic valuation of its norm is ±1. 
Write 05 for its image in PB^ (Q5); then we have 

(8.4) K^.o^.K^ = □ H^Kr,. 

We join any coset gK^ to the six cosets g['w]5K5 : w G A5. The 
resulting structure is independent of 05 . 

(2) More intrinsically, we may identify the quotient S03(Q5)/S03(Z5) 
with the sublattices of Q5 in the orbit of S03(Q5).Z5. Given any 
such sublattice L, the quadratic form takes values on Z5 on that 
lattice and the induced quadratic form on L/5L = (Z/5Z)^ takes 
values in Z/5Z; there are precisely six (= |P^(Z/5Z)|) isotropic lines. 
Choosing any v G L so that the image of v in L/5L isotropic (but 
non-zero), we may construct the new lattice: 

L' = (v/5) -|- {w G L : v.w = mod 5.} 

Then L' depends only on the line spanned by v in L/5L, and belongs 
to S03(Q5).Z5 also. In particular, we construct six such L' , which 
we declare to be the neighbours of L. 
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Then these give equivalent graph structures and moreover: 

(8.5) The resulting graph is a tree, i.e., has no cycles. 

Later on we shall use the following property the verification of which we 
leave to the reader: if L is a sublattice of Q5 representing an element of 
S03(Q5)/S03(Z5), and v G L is such that v.v is not zero mod 5, then: 

(8.6) The orbit S0v(Q5).-^^ is an infinite geodesic in the tree, 

(i.e. an isometric embedding of 7L in the tree w.r.t. the obvious metrics); 
this is a simple consequence of the second definition. 

Let us now check that the neighbors of x G =^(q') for the graph structure 
defined in §2.9| correspond to the neighbors under T5 of the image of x in 



( 5.3 ). Thus, the graph structure on J^diq) coincides with the graph structure 



on r(3_q)\7i. 

In the notations of ^5.4.2, the neighbors of x i.e. {w.x, w G A^} corre- 
spond to the orbits of the triples 

{[Lo,w.x,ld], w £ A5} = {[w'^Lo,x,ldow], w £ A5}; 

now for any p / 5 and w G ^5, [w]p G Kp. Moreover, [w]3 = Id(mod3). 
Therefore the above set equals {[w^^]5.[Lo, x. Id], w £ A5} which manifestly 
agrees with the graph structure introduced above. 



8.3. Proof of Proposition 2.10. We focus now on the action of a prime 
ideal p above 5 on Jif^- Fix vr a uniformizer of Kp, which we write in the 
form a + G [\/—d]. To any x G J^d we associate the quaternion 

Qx = d + bx (the 5-adic valuation of Nr(gx) equals 1) and the corresponding 
rotation tx £ S0x(Q5) induced by conjugation by ^xj since Kp — vr^^^ 

we have S0x(Q5) = t^SO^iZ^). 

The action of p can be interpreted in terms of our adelic viewpoint: We 
have seen in ^6.3 that the group S0x(Q5) acts on with S0x(Z5) acting 
trivially; by projection this also defines an action on Jif^- Via the map 
vr I— )■ Qx ^ one obtains an action of the group tt^ which in fact does not 
depend on the choice of x ( ^6.4[ ); the action of vr coincides with the action 
of the ideal class [p] defined earlier. 

Let us also recall that if x G J^d corresponds to the class [Lo,x, Id] G 
S03(Q)\Q, the element vr.x G J^d corresponds to the class tx[-^0;X,Id] = 
[tx^O) X, Id o t~^] = [ix-^^0)X, Id], the final equality holding because the 3- 
component of tx is trivial. Therefore the trajectory vr^.x is described by the 
infinite sequence of lattices 

. . . , L^2, L-i, Lq, Li, L2, . . . , Li = f^L^. 

All the Li contain x. Write Lj^p for Li^p = Li 0^ Z^; then Li^p = L^^p for 
all i and all p ^ 5, the p-th. component of tx being trivial. So the sequence 
of lattices (Li) is completely determined by the sequence (Lj^s) of its 5-adic 
components. 
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Since the rotation comes from a quaternion whose norm has 5-adic 
valuation equal to 1, the sequence {Li^^)i^i describes an infinite geodesic 



passing through Lo,5 in the tree (cf. 8.2 (1) or more generally (8.6)), 



SO3(Q5).i^0,5 - S03(Q5)/S03(Z5). 

Up to orientation, any such geodesic may be encoded by an infinite non- 
backtracking word in 

where the i-th letter connects the {i — l)-st element of the geodesic along an 
edge of the tree to the i-th element. In the present case, the word associated 
with the sequence (-Li,5)iez is the word {w~^)i^i satifying Lj^s = w~^Li^i^'^] 
equivalently, {wi)i,=.z is the word corresponding to the trajectory of x defined 
in ^2]6} 

Suppose that two points x, x' G M'd give rise to the same truncated word 
of length 2t 

M^(^) : [-1 + 1,1] {^^\S^\C±n =-^5. 

This means exactly that the geodesies SOx(Q5).-/^o,5 and SOx'(Q5).io,5 in 
the Bruhat-Tits tree coincide "from times —i to I " or in other terms 

In particular x' E L^5nL_£5. The last intersection is a sublattice of Lo,5 of 
index 5^^; more precisely, it is the preimage in Lo,5 of the line generated by 
x(mod5^) in Lo,5/5^-^o,5- Thus x and x' are linearly dependent in Lq/S^Lo, 
and since both vectors have norm d, we have x = itx' modulo 5^. This 



concludes the proof of Proposition 2.10 □ 



8.4. Proof of Proposition 2.13, We assume some familiarity with the 



theory of automorphic forms; in any case, we refer to Lubotzky's book 



L ub94[ Chap. 6 & Appendix] . 

The space L'^{PB'' {Q)\P'B'' {Af)/Kf[3,q]) admits an orthogonal decom- 
position into eigenspaces of the commutative algebra generated by Hecke 
operators. These eigenspaces are the set of PB^ (M)i('j[3, g]-invariant vec- 
tors of automorphic representations on PB^. Such representations are of 
two types: 

- one-dimensional representations; 

- infinite dimensional representations. 



The latter corresponds, via the Jacquet-Langlands correspondence JL70 
to automorphic representations of PGL2 which are discrete series of weight 
2 and unramified outside 2, 3 and the primes dividing q (more precisely, one 
can check that the conductor divides 18. g^). From the work of Deligne (or 
rather Eichler/Igusa/Shimura since this is weight 2), the eigenvalue of the 
standard 5-Hecke operator for such spaces is bounded in absolute value by 

2^/5. 
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As for the former: each such is the representation of PB^(A) on the 
one- dimensional subspace generated by the function 

<7GPB><(Q)\PB><(A)^x(Nr(5)) 

where x '■ Q^\A^ i— ?• {±1} is some quadratic character. The action of 
PB^(Q) on such a representation is trivial, as is the action of Kf[2>,q\ (by 
definition) ; moreover since the elements of G come from quaternions of norm 



1, the action of G is trivial as well; hence from (8.3), such representation 
has to be the trivial one. It follows from this enumeration that —6 does 
not occur as an eigenvalue of the 5-th Hecke operator so the graph, while 
connected (cf. above) is not bipartite. 

Remark. The discussion above is also valid for Md{q): this follows imme- 
diately from the previous discussions by projection. In particular we have 

r(g)\PB><(Q5)/i^5 ^ PBX(Q)\PB><(A/)/K}[g] ^ M'M 
with r(,) =PB><(Q)ni^'}[(?]PBX(Q5)• 
9. Expander graphs and random walks 
The contents of this section follow lecture notes of Hoori, Linial and 



Wigderson ^HLW06j. Our goal is to prove Proposition 2.14 

9.1. Let ^ = iy,E) be a (possibly directed) d-regular graph on \V\ = n 
vertices, i.e. the number of incoming edges to each vertex is d, and the 
number of outgoing edges is also d. We assume d > 2. The normalized 
adjacency matrix T of 1^ acts on L'^{V) by 

{xi-^y)GE 

By an abuse of notation, we will use Lp'{^) and L'^iV) interchangeably. More 
generally, ^ may be allowed to have multiple edges and loops, in which case 
we modify the definition of T in the evident way. 

Let ||r|| be the operator norm of T acting on the orthogonal complement 
of the constants in L?'iy). The graph is said to be an a-expander, for 
some a < 1, if ||T|| ^ 1 — a. In rough terms, the smaller ||r|| is, the more 
"strongly connected" the graph ^. 

When we speak of a "random walk on we mean that we select a vertex 
uniformly and randomly from V , and then proceed to walk along directed 
edges, at each stage choosing one of the adjacent edges one with each choice 
assigned probability 1/d. 

Lemma 9.1.1. Let Qi, . . . ,Q£ be subsets ofV, with densities fii := \Qi\/n. 
The probability that a random walk on ^ is in Qj at step j, for all 1 ^ j ^ i, 
is at most 

-1 

(\//W+i + ll^l 

1=1 



n 
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Proof. Let XQi be the characteristic function of Qi, and Ai be the endo- 
morphism of L'^{V) defined by / i— )• XQif- Let 11 denote the projection 
onto the constants, so that Til = 11. The endomorphism Ai.T.Ai^i may be 
decomposed: 

AiTAi+i = Ai.U.Ai+i + AiT{l - U)Ai+i. 

The endomorphism Ai.H.Ai+i may be written as f ^ j^(/)XQi+i); and 

thus has operator norm Since the operator norm of Ai.T{l — 

n).Aj_|_i is at most ||r||, we conclude that the operator norm of AiTAi^i is 
at most ||T|| + (fJ.ifJ^i+i)^^'^ ■ The probability that a random walk visits Qi at 
step i for every i G {1, ...,£} is given by 

\V\-\lv, {AiTA2)(A2TA3) . . . {A,_iTAe)lv) 
and the result follows. □ 

Lemma 9.1.2. Let Q be a subset of V, with fj, := \Q\/n, and let j be a 
random walk of length L Then the probability that I7 n Q| ^ (/^ + e)^ is at 
most 

ci exp(— C2^) 

for positive constants ci,C2 depending only on d, \\T\\,fi,e. 

In other words, the number of "bad" walks of length i, with respect to 
some fixed notion of "bad" , decays exponentially with i. 

Proof. The constants Ci,C2, . . . appearing in the proof are all understood 
to be positive constants depending only on d, \\T\\, /i, e. 

Let 5 be a subset of {1.-^} of size k. It follows from Lemma 9.1.1 that 
the probability that ^i £ Q for i £ S and 7i ^ Q foi' ^ ^ 5 is at most 

Ci/.^(i-^)^-'=(i + — J^L)^ 

min(^, \ — 

Summing over all choices of S we have that the probability that I7 H QI = k 
is at most 

c.Qm'(i-.)"(i + -J^)' 

On the other hand, the sum 



-k 



is at most exp(— 6*2^). Thus, we are done if ||T|| is small enough that (1 + 



min(/i,l— /i) ' 

if this is not the case, we fix an integer C3 ^ 1, and replace the graph 
by the graph ^("-^3) for which a directed edge from x to y corresponds 
to a directed path of length C3 in the graph . (Note that ^^('-^s) may 
have multiple edges and loops.) This improves the spectral gap: if T^'^^^ is 
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the normalized adjacency matrix of l^^'-'^^ we have T('^3) = t^z _ Accord- 
ingly, ||T*^*^^)|| = ||r|p3. Choosing C3 large enough, the argument above 
shows that the probability that a random walk of length H. on the graph 
c^(C3) Qf remains within Q for at least (/i + e)i steps is bounded above by 
C74exp(-C5^). 

It follows immediately that the probability that a random walk on of 
length C^i spends time ^ + e)C'il inside Q is at most C4exp(— Cs^). 
This proves the desired claim. □ 



9.2. The arc graph. Lemma 9.1.2, which tells us that an exponentially 



small proportion of random walks are poorly distributed in will not quite 
suffice for our purposes; what we need to know is that an exponentially small 
proportion of non-backtracking walks are poorly distributed in ^. In this 



section we explain how to derive such a statement from Lemma 9.1.2 

We now assume ^ to be symmetric; that is, E is closed under reversal. 
For each edge a £ E, write a"*" for the target of a and a~ for the source of a. 
We denote the reversal of a by a. With these notations, define the arc graph 
S^' to be a directed graph whose vertices are the directed edges, or arcs, of 
?f . There is an edge from the arc a to the arc b exactly when a"*" = b~ and 
a / 6. 

Thus, is regular of degree d — 1. We denote by T' the normalized 
adjacency matrix of J^': 



d-1 

b-=a+ 

The key feature of the arc graph for us, is that we have a natural bijection 
between non-backtracking paths of length i on^, and paths of length i — 1 
on 

9.3. Our goal will be to deduce a spectral gap for T' from that for T. This 
is a simple analogue of Atkin-Lehner theory in the subject of modular forms: 
when d = p + 1 for some prime p, we can think of as a quotient of the 
Bruhat-Tits tree attached to PGL2(Qp)/PGL2(Zp); then the arc graph of 
the Bruhat-Tits tree is obtained by replacing PGL2(Zp) with an Iwahori 
subgroup, so that the passage from graph to arc graph is much the same as 
the passage from the congruence subgroup ro(A^) to the smaller subgroup 

ro(A^p). 

There are natural maps B,E : — t- L^(J^') ("beginning" and "end") 
defined via 

Bf{a) = fia-), Ef{a) = fia+). 

Moreover, the orthogonal complement to Im(i?) © lm{E) consists of those 
functions F G L^(J^') with the property that 



E 
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for all V E . On this orthogonal complement (the "new space") , the 
operator T' acts via 

(9.1) "^-^Z' 
where F{a) = F(a). Moreover, one checks that 

{Bfi,Ef2) = d{Th,f2), T'oB = E. 

Thus, if w G L'^{W) is an eigenfunction for T with eigenvalue A, then the 
"old space" C{Bw) + C{Ew) is stable under T' . Prom this we see that every 
eigenvalue of T' on this space is also an eigenvalue of the matrix : 

1 

-1 dX 



(d-i) (d-i) 

It is easily computed that the eigenvalues are bounded away from 1 if A is. 



By (9.1), the eigenvalues of T' on the new space are bounded in absolute 
value byl/((i— 1)<1. We conclude that ||r'|| is bounded away from 1 if 

||r|| is. 

Proposition 9.4. Let = (V^, F) he an undirected graph with \\T\\ < 1, 
and let Q be a subset ofV, with fj, := \Q\/n. The probability that a random 
walk without backtracking of length i spends more than {fi + €)i time in Q 
is at most 

ci exp(-C2^) 

for constants ci,C2 > depending only on d, \\T\\,fi,e. 



Proof. Let Q' C Vi^' be the subset of arcs w hose i nitial vertex lies inside 
Q. Noting that |^ = jyj, we apply Lemma 9.1.2 to {^',Q') taking into 



account that ||T'|| is bounded away from 1 in terms of ||T||. □ 



Part 5. Related topics 

10. Miscellanea and extensions of the ergodic method. 

In this section we comment on various modifications that can be made in 
the assumptions or the proofs. 

10.1. The squarefreeness assumption. Through this paper, we have as- 
sumed that d is squarefree. This is mainly for simplifying the exposition. 
It is true in general that some quotient of J^d by a subgroup of SO^CZ) is 
acted on by the ideal class group of some quadratic order containing \/—d. 
However, when d is not squarefree this action is not necessarily homoge- 
neous: there may be several orbits. This has to do with the existence of 
non-primitive elements in An element x = (a, 6, c) E J^d is primitive 
iff the integers a, b, c are coprime; when d is squarefree, every element of J^d 
is primitive. On the other hand, if we choose d of the form do/^, the subset 
f'^'^do C J^d forms a separate orbit of the class group action. The appropri- 
ate repair is to consider just the set of primitive elements Jiff C Jifd] these 
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elements again form a homogeneous space for a certain class group. By the 
methods of this paper one can then show the equidistribution of icedq{J^J') 
or redoo(=^^) as d — )■ oo. Using the decomposition 

^d=\_\ f-^dip 

one can conclude that redq(J^) and redoo(=^) are indeed equidistributed. 
10.2. Distribution of trajectories. One of the important ingredients for 



the proof of Theorem 1.4 has been to consider the trajectories associated 
to the various points in Md and to show that these trajectories are, in a 
suitable sense, "well-spaced" . 

The principle of lifting a point to its trajectory and then of studying 
the distribution of the trajectories occurs in several recent applications of 
ergodic methods to number theory. For instance, a (much less evident) 
version of this principle occurs in the proof of the arithmetic quantum unique 
ergodicity conjecture, in which the role of the lifted trajectories is played by 
the "microlocal lifts" [Lin06| . 

In fact it is possible to extrapolate the above principle to showing that 
segments of trajectories of hounded length become equidistributed. For in- 
stance, in the context of Theorem |1.3[ one can prove the following result: 



Theorem. Under the assumptions of Theorem 1.3, fix £q > 0, and let 7o^°'' 



be a fixed non-backtracking marked path on J^diojof length 2£q then the 
cardinality of the set of x £ for which 

is equal to 

3.5^'-W)| l^'l" + °"»-''^°°- 

(Note that 3-52'^o-i|J^(g)| is the total number of non-backtracking paths 
on J^d{q) of length 21q + 1.) 



In combination with Proposition 2.10 this implies that is well-distributed 



5-adically, as well as mod q. This may be seen as the analogue of Duke's 
theorem on the equidistribution of closed geodesies on the unit tangent bun- 
dle of the modular surface Xo{l) (cf. |Duk88] [Theorem 1]). See |ELMV09b 



for a purely dynamical proof of the latter, following principles similar to the 
ones of the present paper (although in a different language). This theorem 
also admits a variant in the context of Theorem 11.11 and uniform versions 



analogous to Theorems 1.2 and 1.4 



10.3. Application to mixing. As an application of the uniformity reached 
in Theorems 1 1 . 1 1 and 1.4 we have the following mixing type results: 
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Corollary. Fix 6 > 0. Let q be a fixed integer coprime with 30. For each 
squarefree d = ±1(5) and coprime with q, let he a primitiv^ ideal of 
such that N(p(i) < d}/'^^^ ; we suppose moreover that as d ^ +oo, N(prf) — )• 
+00. Then, for d — ?• oo the set 

(10.1) {redoo(x, [Prf].i), ± G S03(Z)\^rf} C SO^{'L)\S^ x SO^{'L)\S^ 

becomes equidistributed on S03(Z)\S'^ x S03(Z)\S'^ with respect to the prod- 
uct of Lebesgue measures on each factor. Similarly, the multiset 
(10.2) 

{red,(i, [pj.x), X e SOs{Z)\J^d} C S03{Z)\Jif'd{q) x S03(Z)\jrd(g) 

becomes equidistributed on S03(Z)\.^(g) x S03(Z)\.^(g) with respect to 
the counting measure. 

Notice that if the norm N(prf) remains constant for d varying (possibly over 



a suitable subsequence), a simple variant of Theorem 1.2 above shows that 



the set ( 10.1 ) (resp. ( 10.2 )) becomes equidistributed along some codimension 
two subvariety of (S03(Z)\S'^)^ (resp. a multiset of order N(prf)g^ in 
(S03(Z)\=^(g))^ ), namely the graph of the N(pd)-th Hecke correspondance. 
This explains the condition N(p^) — t- oo. 

In |MV07| , the second and third authors, proposed a "mixing conjecture": 
namely the quoted statement should remain true so long as N(prf) tends to 
cxD as d — 7- oo. Not surprisingly, the range not covered by the corollary 
(i.e. ^ N(pd) < d^/^logd for < e{d) 0) is by far the most 

interesting range of the conjecture; for instance, the applications sketched 
in 



MV07 depend crucially on this range. 



10.4. Extensions to other quadratic forms. For the indefinite discrim- 
inant quadratic form x'^ — yz, results analogous to Linnik's theorems have 



been obtained by Skubenko using the crgodic method (see Lin68, Chap 
VI] and the references inside). In [ELMV09bj , the second and third au- 
thors together with M. Einsiedler and E. Lindenstrauss gave an alternative 
exposition of the ergodic approach which in fact allows for stronger results. 

More generally, Linnik's ergodic method carries over when one replaces 
the quadratic form x'^+y'^+z'^ defined over Q by any non-degenerate ternary 
quadratic form Q defined over a fixed number field F. For instance, suitable 



generalizations of Theorems 1.1 and 1.3 over number fields have been ob- 



tained by Teterin [Tet83| . All the main theorems of the present text remain 
valid when suitably adapted to this more general context. In this setting, 
the condition d = ±1 modulo 5 is to be replaced by the existence of a fixed 
prime ideal in F which splits in F{^y—ddisc{Q)). 



10.5. The discriminant aspect. Theorems |1.4| and 1.2 give results that 
have nearly optimal uniformity in the modulus q or in the radius of the ball 
p. However, the ternary form Q whose representations of integers we study 
is fixed as Q{x,y,z) = x'^ + y'^ + z^ . It would be interesting to investigate 

primitive ideal is an integral ideal of minimal norm in its ideal class. 
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the option of proving analogues of these results where Q is allowed to vary, 
with the aim of obtaining optimal uniformity in the discrimant disc((5). In 
particular, if the form is definite, its genus grows on the order of disc(Q). 



One would envisage an analogue of Theorem 1.4 that establishes that almost 
every Q' G genus((5) represents d (if everywhere locally representable) , so 
long as |genus((5)| ^ d^l"^^^ . 

One example where this can be carried out completely is when Q is the 
reduced norm on the space of trace zero elements in the quaternion algebra 
which is ramified at oo and at some prime q> 2: then |genus((5)| ~ In 
that case the equidistribution results may be interpreted in terms of super- 
singular reduction of CM elliptic curves (cf. the paper of D. Gross |Gro87| ): 

Let denote the set of isomorphism classes of supersingular elliptic 
curves over F^; these are in fact defined over Fq2. Then is canonically 
identified with genus(Q) and so has size ~ (7/12. For K an imaginary qua- 
dratic field in which q is inert, let Sffj^ denote the set of elliptic curves in 
characteristic with complex multiplication by ffx ] these curves are defined 
over the Hilbert class field of and 

|f^^| = |disc(ir)|V2+°{l). 

For q, a place in Q above p, we have a "reduction mod q"-map 



redq : ^ Sg' . 



For any E G define 



1 \^ed-\E)\ 



where 

l/|Aut(E)| _ 



EE'e^,»^l/|Aut(i?')r 

^ defines a probability measure on <5|*. 

An analog of Theorem |1.4| in this context is the following: 

Theorem 10.6 (Equidistribution of "Grauss" points). Fix 6,r] > 0. Let q 
he as above, and let p, be a fixed auxiliary prime with {p,2q) = 1. Let K 
range over the imaginary quadratic fields such that 

(1) q is inert in K , 

(2) p is split in K. 

Set d = |disc(K)|. As long as q ^ d^^'^^^ , the fi-measur^ of E £ E^" such 
that 

|devrf(E)| > T] 

tends to as d ^ +oo. 

^one could replace "the ^-measure of" by "the fraction of" since 1 ^ Aut(i5) ^ 12 
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The proof is similar to that of Theorem 1.4, what is needed is the analog 
It is proven by a suitable generalization of the geometric 



of Corollary 2.12 



part of the Gross formulas |Gro87 Prop. 10.8]. As noted in |Mic04| , using 
the formulas of Gross and subconvex bounds for L-functions, one can prove 
a version of this result, with //-measure and without the condition ([2]), as 
long as q is less than a small positive power of d; under GRH this may be 
further extended to q ^ d^^^~^ . 



10.7. Representation of quadratic forms by quadratic forms. As we 

briefly discussed in ^ the problem of studying the representation of an 
integer by a quadratic form admits another sort of generalization. Namely, 
one can ask about representations of a varying integral quadratic form of 
rank m by another, fixed, quadratic form of higher rank n ^ m. When 
m = 1, n = 3, and the rank-3 form is + + z^, we are in the situation of 
the present paper. The general problem basically splits into two questions: 

- the existence of such representations; 

- how these are distributed (when they exist) in real or p-adic topolo- 
gies. 

As is explained in [EV08] , both the existence and the distribution problem 
can be understood in terms of the distribution properties of adelic orbits of 
some algebraic subgroup H inside an adelic homogeneous space associated 
to some ambient algebraic group G D H. More precisely, we can take G to 
be a form of SO^ and H a form of SO„_m. 



The work EV08 considered cases for which n — m^S. In that case, since 
noncompact forms of SOn-m have plenty of unipotents, one can apply the 
results of M. Ratner to show that (under suitable technical hypotheses) the 
adelic orbit of the smaller group equidistributes in the adelic quotient of the 
larger group; this suffices to prove the existence of such representations. The 
second question, regarding equidistribution, was not addressed in EV08 but 
likely follows from similar methods. 

This is in sharp contrast with the present situation where n = 3, m = 1. 
In this case, the smaller group is a torus (a form of SO2); in particular, 
Ratner's theorems do not apply, and one needs to use the special methods 
described in the present paper. It should be noted, moreover, that we are 
helped by the fact that SO2 C SO3 is a maximal torus; it seems that the 
problem of studying representations of, say, forms of rank 2 by a given form 
of rank 4, would be significantly harder. 



11. Harmonic analysis and Linnik's method. 

11.1. Duke's approach via harmonic analysis. Although our main con- 
cern is with extending Linnik's method, we briefly remark on Duke's method 
and the relevance of modular L-functions to our problem. 

As we have remarked, the condition "d = ±1 modulo 5" has the effect 
of ensuring that 5 splits in Q{y/—d); one could replace the role of 5 by any 
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other fixed prime, as did Linnik. The problem of removing such a constraint, 
however, proved difficult. This question, i.e.: 

(11.1) Can we guarantee the existence of a small p with (^"^^ = 1? 

is a very difficult and deep one, for it is intimately connected to the issue 
of the Landau-Siegel "exceptional" zero, i.e. to effective lower bounds for 
the class number of Q{\/—d). For instance, observe that proving that there 
exists a prime p as in ( 11.1|) with p ^ d'' would immediately show that the 



class number of Q(v — d) tends to oo as d — )• oo; itself, far from a trivial 
result. 

It was therefore a considerable breakthrough when W. Duk^ Duk88 
DSP90j established Theorem 1.3 and Theorem |1.1| without the auxiliary 



assumption. Duke's original approach used the Maass-Shimura theta cor- 
respondence to express the Weyl sums attached to these equidistribution 
problems, i.e. 

for if is a (smooth) function on (resp. J^d{Q)) with zero mean with respect 
to the Lebesgue (resp. the counting) measure, in terms of Fourier coefficients 
of some half-integral weight holomorphic form 9{(p) |Wal81|; indeed, 9{ip) is 
a classical ^-series. The decay of the Weyl sums then followed from non- 
trivial bounds for these Fourier coefficients, which were obtained using in a 



key way ideas and results of H. Iwaniec Iwa87 . 



11.2. Central values of automorphic L-function: Waldspurger's for- 
mula. It was quickly realized that these problems are also related to L- 
functions and to the problem of bounding them non-trivially at the central 
point: if ip is an eigenfunction of (almost all) the Hecke operatora^ acting 



on S (resp. J^(g)), then by a formula of Waldspurger Wal85| one has 



, .X \W{ip;d)\-' dV2 L(7r,l/2)L(7r0Xd,l/2) , . . . jr . . . . 

{ip,<f) yj^dY L7r,Ad, 1/2 \^ 

pm 



where 



( , ) is the inner product associated to the Lebesgue (resp. counting) 
probability measure on 52 (resp. J^d{q)) and c > is an absolute 
explicit constant; 

TT is the automorphic representation of PGL2 in Jacquet-Langlands 
correspondence with the automorphic representation of SO3 = PB^ 
generated by vr is a discrete series representation (i.e. correspond- 
ing to an holomorphic modular form) of conductor dividing 18g^. 
Xd is the Kronecker symbol associated with the quadratic field Q(\/— d); 



^partly in collaboration with Schulze-Pillot 

^'^This is not a restriction, since the space of functions on S"^ or Mdiq) is generated by 
such Hecke eigenforms. 
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- the factors Ioo{v,d), Ip{ip;d), p\6q are some local integrals (real or 
p-adic) which can be bounded explicitly in terms of if but indepen- 
dently of d. 

From this formula, one sees that the decay of W{ip;d) as d — )• oo is a 
consequence of the so-called subconvex bound: there is some absolute 5 > 



(11.3) 



Such a bound was proven, for the first time, in [Iwa87| . A few years later, a 
more general bound (with 1/2 replaced by 1/2 + it) was obtained by Duke, 
Friedlander, Iwaniec by an entirely different method [DFI93| (again without 
any condition on d). 

Remark. The method of Duke has been generalized by Duke and Schulze- 
Pillot to general ternary quadratic forms over Q (DSPQOl. More recently. 



the alternative approach involving Waldspurger's formula (11.2) has been 



extended to treat ternary forms over general number fields, in particular, 
for totally definite ternary forms over totally real number fields: this has 



been carried out by Cogdell, Piatetsky-Shapiro and Sarnak CogOS ; in this 



generality the required subconvex bound may be found in Ven05 . For more 



comments about this we refer to |MV06 and to |BH09 where a complete 



exposition of these arguments is given along with sharp form of (11.3) 



11.3. Ergodic Theory vs. Harmonic Analysis. Now let us compare 
what the ergodic and harmonic- analysis techniques yield, as regards to The- 
orem |1.4| and |1.2[ It is possible to show that the product of the local inte- 
grals looi^'i d) HpiGg ^pi'^'-' d-) is bounded in terms of some Sobolev norm on ip. 



Moreover, the dependence on vr in the bound (11.3) is polynomial in Q{'Jt) 

From 



the analytic conductor of vr (as defined by Iwaniec and Sarnak [ISOOJ 
this one can deduce, without any condition on d that, for every x (resp. ev- 
ery X G S*^), |devrf(x)| (resp. \deVii{Q{x, p))\) is bounded by a small negative 
power of d as long as q (resp. p~^) is less than a (small) positive power 
of d (Bb04l[D^Ik05l|GF87| . As we explain below, assuming the Generalized 
Riemann Hypothesis (GRH), this result holds when q (resp. p~'^) is less than 
^ d}/^"" for any fixed > 0. 

We consider for simplicity the case of M'dio)- Let B denote an orthogonal 
basis in the space of functions on J^(q). The average 



E 



is independent of B. Therefore, (picking as a basis {Jx, x G J^(g)}) such a 
sum equals 

E k«d-i(x)|2. 
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Taking B of the form {1}U Bq where Bo is constituted of Hecke eigenforms, 
we obtain 



- (red„ (x) 



(if, if) ' ^ ^ ' ' \^d{q)V 



Hence by ([1L2|), 

^^>^dev,(x) =cdl ^ L(,,Ad,l/2) ^oo(v^;d)n/,(v.;d). 

The Generahzed Riemann Hypothesis and a local computatioip] shows that 
for any e > 

(11.4) 5^devrf(x)2 (^qdfq^ld^^ 

X 

Linnik's basic Lemma alone shows the same result with an additional factor 
(1 + d/q^) on the right-hand side; in particular, (11.4) is known uncondi- 



tionally for q > d. Now (11.4) shows, for > 0, that: 

- devd(x) ^ d^'^ for any x G J^diQ) as long as g ^ d^^^~'^ (thus 
proving the surjectivity of the reduction map redg in this range); 

- given 5 > 0, the fraction of x G J^dig) for which |devd(x)| > (5 is 
bounded by <^s d~^'^ as long as g ^ d^^^~'^~'^. 



In particular we recover Theorem 1.4 On the other hand we don't see at 



the moment, even under GRH, how to provq^ that there are no x for which 
red"^(x) = in the range d^/^ ^ g ^ d^^*'". 



Remark 11.1. It should be noted that (11.4), obtained by pointwise ap- 
plication of GRH, is essentially a "sharp on average" bound for the family 
T of L- values 

{L{'ir, l/2)L{TT(S'Xdj 1/2), tt of conductor 18g^ and trivial central character}. 

The size | of that family is w while the conductor C of any of these L- 
functions is of size ~ q^d'^. In particular, in the range q^ ~ d, we have ~ 
|C|^/^. Starting with the original work of Weyl, there are, by now, several 
examples of families of L-functions satisfying a similar relation between the 
size of the family and the size of the conductor and for which the central 
values could be bounded sharply on average without any hypothesis [Wey21 



CIOO Li08 



^"'^to bound the local integrals Ip{tp;d) 

^■^This type of situation is not uncommon; for instance, it is known that the GRH 
implies that the smallest prime in an arithmetic progression with modulus g is ^ c{e)q'^'^'; 
while we surely expect the result with 2 + e replaced by 1 + e, no proof of this is known 
even under GRH. 
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